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PREFACE 


The  Earth  Observations  Division  version  of  the  Laboratory  for 
Applications  of  Remote  Sensing  System  reflects  the  efforts  of 
many  researchers  and  programmers  over  a period  of  several  years. 
Special  assistance  and  consultation  were  provided  to  the  authors 
by  K.  Eaker  and  A.  H.  Feiveson  of  the  Earth  Observations  Division 
of  the  National  Aeronautics  and  Space  Administration,  Lyndon  B. 
Johnson  Space  Center  and  N.  J.  Clinton,  W.  G.  Eppler,  T.  C. 

Min ter,  and  J.  A.  Quirein  of  Lockheed  Electronics  Company,  Inc. 

Some  of  the  software  incorporated  in  this  version  of  the  system 
includes  program  modules  developed  for  other  systems.  Two 
routines  from  the  Algorithm  Simulation  Test  and  Evaluation  Pro- 
gram were  provided  by  M.  T.  Li  of  Lockheed  Electronics  Company, 
Inc.,  and  J.  K.  Daly  of  the  TRW  Systems  Group.  In  addition, 
specific  routines  were  provided  by  A.  H.  Feiveson  to  compute 
thresholds  for  the  classification-by-thresholding  algorithm. 
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1 . INTRODUCTION 

l 

For  several  years,  the  Earth  Observations  Division  (EOD)  of  the 
National  Aeronautics  and  Space  Administration  (NASA) , Lyndon  B. 
Johnson  Space  Center  (JSC) , has  supported  research  for  the  devel- 
opment of  techniques  to  be  used  in  processing  remotely  sensed 
imagery  data  obtained  from  the  multispectral  scanner  (MSS)  placed 
aboard  various  aircraft  and  satellites.  One  of  the  earliest 

operational  computer  systems  to  use  pattern  recognition  tech- 

I 

niques  in  the  analysis  of  these  data  was  developed  at  Purdue 
University's  Laboratory  for  Applications  of  Remote  Sensing  (LARS). 
The  earliest  version  of  the  LARS  system  (LARSYS)  was  converted 
in  1970  to  a batch  program  for  execution  on  the  Univac  1108 
EXEC  2 system  at  NASA/ JSC  (ref.  1). 


The  computer  system  described  in  this  document  originated  from 
this  early  version  of  the  LARSYS.  However,  since  1970,  personnel 
of  the  EOD,  Lockheed  Electronics  Company,  Inc.  (LEC) , and  other 
EOD  support  contractors  have  made  many  modifications  and  improve- 
ments to  the  Univac  1108  version  of  the  LARSYS;  thus,  new  tech- 
niques have  been  developed  and  programmed  to  perform  additional 
functions  in  the  evaluation  of  the  data. 

Although  the  basic  structure  of  the  system  remains  the  same, 
a large  portion  of  it  has  been  reprogrammed.  Modifications  to 
existing  techniques  and  the  addition  of  new  techniques  have 
expanded  the  capabilities  of  the  system.  The  current  version 
is  called  the  EOD-LARSYS . 

The  purposes  of  this  document  are  to  define  the  capabilities 
and  limitations  of  the  system  and  to  provide  the  user  with  the 
information  needed  to  execute  the  program  and  obtain  the  output 
desired.  It  is  assumed  throughout  the  document  that  the  user 
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is  familiar  with  the  terminology  and  the  pattern  recognition 
techniques  involved.  No  attempt  is  made  to  assist  the  user  in 
the  analysis  of  data  output  by  the  system. 


2.  GENERAL  SYSTEM  DESCRIPTION 


The  EOD-LARSYS  is  a batch  processing  program  operational  on  the 
Univac  1108  EXEC  2 system  at  NASA/JSC*  The  system  is  composed 
of  a system  monitor  and  a set  of  processors,  each  of  which  per- 
forms a specific  function  in  the  analysis  of  MSS  imagery  data. 
Linkage  between  processors  is  accomplished  by  the  use  of  files 
or  card  decks.  The  execution  of  a particular  batch  job  may  begin 
or  end  with  any  processor,  provided  the  appropriate  files  or  card 
decks  are  furnished. 

Two  pattern  recognition  classification  schemes  are  provided  by 
the  system.  One,  the  supervised  classification  algorithm  known 
as  the  maximum  likelihood  classifier,  is  embodied  in  the  CLASSIFY 
processor  (ref.  2).  The  other,  an  unsupervised  classification  or 
clustering  algorithm,  is  embodied  in  the  Iterative  Self-Organizing 
Clustering  System  (ISOCLS)  processor  (ref.  3).  ISOCLS,  along 
with  other  processors,  may  be  used  to  "train"  the  maximum  likeli- 
hood classifier  or  to  display  the  results  of  classification. 

Having  obtained  an  MSS  image  data  tape  (DATAPE)  in  one  of  the 
allowable  formats  (see  section  3.2),  the  data  analyst  must  train 
the  classifier.  The  maximum  likelihood  classification  algorithm 
is  based  on  the  assumption  that  the  samples  within  a given  class 
are  distributed  according  to  a multivariate  normal  probability 
density  function.  Such  distribution  is  specified  completely  in 
terms  of  a mean  vector  and  a covariance  matrix,  which  must  be 
computed  from  known  samples  of  the  class  being  represented.  This 
implies  that  the  data  analyst  must  have  some  prior  knowledge  (i.e., 
ground-truth  information)  of  specific  areas  within  the  MSS  image. 
Using  this  ground  truth,  the  analyst  must  identify  training 
samples  for  computation  of  statistics.  To  aid  the  analyst  in 
locating  these  training  samples,  the  histogram  (HIST)  and 
GRAYMAP  processors  are  provided. 


The  HIST  processor  provides  a histogram  of  data  values  from  the 
MSS  image  for  use  by  the  GRAYMAP  processor.  HIST  may  also  be 
used  independently  to  provide  the  analyst  with  information  on 
the  distribution  of  data  values  within  specific  user-defined 
blocks  (or  fields)  of  the  image.  The  mean,  standard  deviation, 
and  range  of  data  within  each  user-defined  field  are  standard 
outputs  from  the  HIST  processor  for  each  requested  channel. 
Histogram  plots  may  be  obtained  optionally.  With  the  histogram 
information,  a file  (HISFIL)  is  written  automatically  for  the 
GRAYMAP  processor. 

The  GRAYMAP  processor  provides  the  analyst  with  a pictorial 
gray-scale  map  of  any  channel  of  the  MSS  image  for  use  in 
obtaining  training  field  coordinates.  The  map  is  labeled  by 
sample  and  scan-line  numbers.  From  this  map,  the  analyst  may 
locate  the  fields  within  the  image  for  which  he  has  ground- 
truth  data.  Having  identified  the  fields,  coordinates  (sample 
and  scan- line  numbers  at  each  vertex)  must  be  noted  for  defining 
the  fields  to  the  statistics  (STAT)  or  ISOCLS  processor.  Since 
the  output  device  used  to  make  the  line  printer  map  has  fewer 
symbols  than  the  256  gray  shades  allowed  by  the  MSS  data  resolu- 
tion, a reduction  in  the  number  of  gray  shades  must  be  made.  The 
map  produced  by  GRAYMAP  is  printed  with  up  to  16  different  sym- 
bols. The  analyst  may  define  the  symbols  and  corresponding  range 
from  the  histogram.  The  latter  method  allows  equal  activity  for 
each  of  10  standard  symbols  chosen  to  produce  a wide  range  of  gray 
shades . 

The  coordinates  for  the  training  fields  may  be  input  to  either 
the  STAT  or  ISOCLS  processor  for  computation  of  statistics  for 
the  classifier.  Both  processors  save  the  statistics  and  train- 
ing field  information  on  a file  (SAVTAP,  section  4.1)  for  use 
in  other  processors. 


In  using  the  STAT  processor,  the  user  must  group  training  fields 
into  statistically  similar  subclasses.  Subclasses  may  be  grouped 
further  into  classes.  For  example,  three  statistically  similar 
subclasses  of  spring  wheat,  winter  wheat,  and  harvested  wheat  may 
be  grouped  into  one  wheat  class.  Statistics  for  each  subclass 
are  maintained  on  the  SAVTAP  file,  along  with  the  class  grouping. 
Class  groupings  are  maintained  simply  for  convenience  in  defining 
categories  in  the  CLASSIFY  processor  and  for  performance  reports 
by  the  DISPLAY  processor.  The  analyst  may  obtain  the  following 
output  for  each  training  field  and/or  subclass. 

• Mean  vector 

• Covariance  matrix 

• Correlation  coefficient 

• Histogram  plots 

• Spectral  plots 

In  using  the  ISOCLS  processor,  the  user  must  group  training 
fields  into  classes.  The  clustering  process  breaks  the  class 
data  into  statistically  similar  subclasses  (clusters) . Sub- 
classes are  given  names  by  taking  the  first  three  characters 
of  the  class  name  and  three  digits  indicating  the  number  of 
the  subclass  within  the  class.  Again,  the  statistics  for  each 
subclass  are  saved  on  the  SAVTAP  file  for  use  in  other  proc- 
essors. ISOCLS  is  an  iterative  self-organizing  clustering 
procedure  which  uses  the  measure  of  absolute  (LI)  distance  from  a 
picture  element  (pixel)  to  the  cluster  center  to  determine  the 
similarity  of  pixels.  At  each  iteration  the  user  may  obtain  a 
cluster  summary  and  map.  Optionally,  a cluster  image  data  tape 
(MAPFIL)  may  be  output  in  either  LARSYS  II  or  Universal  format 
(appendixes  A and  B,  respectively) . 


Using  the  transform  statistics  (TRSTAT)  processor,  the  statistics 
on  the  SAVTAP  file  output  by  the  processor  STAT  or  ISOCLS  are 
transformed  according  to 


y ' = Ay  + b 
K'  = AKAT 


(2-1) 


where 

y ' = transformed  means 
A * a matrix 

y = means  from  SAVTAP  file 
b -a  vector 

K'  = transformed  covariance  matrix 
K = covariance  matrix  from  SAVTAP  file 

A = transpose  of  matrix  A 

The  transformed  statistics  are  output  as  a new  file  on  SAVTAP. 


Before  proceeding  to  classification  of  the  MSS  image,  it  may  be 
desirable  to  reduce  the  dimensionality  of  the  data  vectors  by 
selecting  a smaller  set  of  channels  or  a linear  combination  of 
the  channels  which  maximizes  some  class  separability  measure. 

The  SELECT  processor  (ref.  4)  provides  this  capability.  In  order 
to  compute  the  value  of  the  separability  measures,  the  statistics 
calculated  using  the  STAT  or  ISOCLS  processors  must  be  made 
available  to  the  SELECT  processor  either  by  card  deck,  tape,  or 
Fastrand  file.  The  SELECT  processor  allows  the  analyst  to  work 
with  subsets  of  the  statistics  on  the  file,  if  he  desires.  Sub- 
sets of  the  statistics  are  indicated  by  the  CHANNELS  and  SUBCLASS 
control  cards,  which  are  defined  further  in  section  10.4.3. 


In  addition,  the  statistics  for  two  or  more  subclasses  may  be 
grouped  together  and  considered  as  one  subclass.  Grouping  the 
statistics  for  two  or  more  subclasses  is  equivalent  to  going 
back  through  the  STAT  processor  and  combining  all  training 
fields  for  those  subclasses  being  grouped  into  one  subclass. 

The  grouping  option  is  exercised  via  the  GROUP  control  card 
defined  in  section  10.4.3.  The  subsets  and  groupings  of  the 
statistics  provided  to  the  SELECT  processor  for  computation 
are  used  only  in  SELECT  and  are  not  passed  on  to  other 
processors . 

The  SELECT  processor  also  allows  the  analyst  to  evaluate  a 
given  set  of  channels  using  one  of  three  different  separability 
measures  or  to  select  the  best  set  of  channels  (k)  out  of  the 
total  channels  (n)  based  on  one  of  these  separability  measures. 

The  three  separability  measures  provided  are: 

a.  Weighted  average  interclass  divergence 

b.  Weighted  average  transformed  divergence 

c.  Weighted  average  Bhattacharyya  distance 

To  select  the  best  set  of  k out  of  n channels,  the  analyst  may 
use  either  the  Without  Replacement  Search  Procedure  or  the 
Exhaustive  Search  Procedure.  A third  procedure  will  find  k 
linear  combinations  of  n measurements  which  extiremize  a given 
separability  measure.  This  procedure,  ifenown  as  the  Davidon- 
Fletcher-Powell  Procedure,  outputs  the  linear  combinations  in 
matrix  form.  All  the  procedures  and  equations  for  separability 
measures  referred  to  above  are  discussed  in  detail  in  reference  5., 

After  the  SELECT  processor  has  determined  the  subset  or  the  linear 
combination  of  channels  which  maximizes  subclass  separability,  the 
supervised  classification  of  the  imagery  data  is  performed  by  the 
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CLASSIFY  processor.  The  options  available  in  SELECT  for  grouping 
and  selecting  subsets  of  the  SAVTAP  file  are  also  available  in 
CLASSIFY. 

However,  once  the  statistics  for  classification  have  been  speci- 
fied, the  classes  and  subclasses  are  renumbered  and  referred  to 
in  the  DISPLAY  processor  by  the  new  numbers. 

The  CLASSIFY  processor  allows  the  user  to  group  classes  previ- 
ously defined  by  the  STAT  or  ISOCLS  processor  into  categories 
for  the  sum-of-densities  classification.  When  a category  is 
defined  (by  class  names) , all  subclasses  in  each  class  are 
assigned  to  the  category.  The  density  function  for  category  i, 
P^(x),  is  the  sum  of  densities  for  all  subclasses  in  the  cate- 
gory; that  is, 

ki 

pi(x)  = £ Pj  (x)  (2-  2) 

j=l 

where 

Pj  = the  probability  density  function  for  subclass  j 

k^  = the  number  of  subclasses  in  category  i 

(Note;  More  detailed  equations  are  given  in  section  11.) 

Pixel  x is  assigned  to  category  i if  P^(x)  > P^(x)  for  all  cate- 
gories k ^ i.  Pixel  x is  further  assigned  to  subclass  j if 
(1)  j belongs  to  category  i and  (2)  p.(x)  > p.(x)  for  all  sub- 

j K 

classes  k,  k / j,  and  k belongs  to  category  i. 

Obviously  a one-to-one  correspondence  between  categories  and 
subclasses  reduces  the  above  equation  to: 


When  this  is  the  case,  the  amount  of  computation  required  for 
classification  can  be  greatly  reduced  by  the  use  of  thresholds. 
CLASSIFY  then  has  two  procedures  for  classification  which  use 
this  computational  reduction  to  advantage.  The  sum-of-densities 
rule  is  used  only  when  categories  are  defined  by  the  user.  Other- 
wise, the  classification-by-thresholding  procedure  detailed  in 
reference  5 is  used. 

The  CLASSIFY  processor  writes  a file  (MAPTAP,  section  4.4  and 
appendix  C)  containing  the  subclass  number  and  confidence  level 
for  each  pixel  classified;  the  training  fields  and  statistics  for 
the  classes  and  subclasses  actually  used  in  classification;  and 
the  correspondence  between  categories,  classes,  and  subclasses. 

The  DISPLAY  processor  accepts  the  file  output  by  CLASSIFY  and 
generates  a line-printer  map  of  the  classified  data,  along  with 
several  performance  tables.  In  the  map,  each  subclass  has  a 
symbol  associated  with  it.  A threshold  option  is  provided  for 
the  analyst  to  print  no  symbol  (blank)  for  samples  classified 
with  a confidence  level  less  than  some  specified  threshold  value. 

Performance  summaries  are  provided  on  subclass,  class,  and 
category  levels  for  pixels  within  each  classified  field,  train- 
ing field,  and  test  field  which  are  input  to  the  DISPLAY  proc- 
essor. The  training  or  test  field  performance  summaries  may  be 
obtained  by  fields  and/or  classes.  The  DISPLAY  processor  also 
provides  optional  output  of  a classification  map  (MAPFIL)  on  tape 
in  either  Universal  or  LARSYS  II  format. 

The  data- transformation  (DATA-TR)  processor  allows  the  analyst 
to  use  the  linear  transformation  matrix  computed  by  SELECT  to 
create  a new  image  data  tape  (TRFORM) . Since  the  matrix  is  com- 
puted to  extremize  subclass  separability,  the  k linear  combina- 
tions out  of  n channels  represented  by  the  matrix  produce  better 


class  contrast  when  the  image  is  displayed;  that  is,  on  the  Data 
Analysis  Station  (DAS) . In  addition,  the  best  linear  combination 
of  the  data  can  be  used  to  enhance  the  image. 

The  TRFORM  tape  may  be  output  in  either  the  LARSYS  II  or  Universal 
format. 

The  NDHIST  processor  performs  an  n-dimensional  histogram  of  areas 
on  the  MSS  data  tape  (DATAPE) , for  which  the  user  wishes  to  create 
scatter  plots.  The  fields  may  be  histogrammed  on  a class,  sub- 
class, or  per-field  basis.  A line-printer  summary  of  the  fields, 
the  number  of  data  vectors  in  each  field,  and  the  number  of  unique 
data  vectors  histogrammed  is  given. 

Optionally,  if  a scatter  plot  of  a classified  or  clustered  area 
is  requested,  a classification  or  cluster  image  data  tape  (MAPFIL) 
from  the  DISPLAY  or  ISOCLS  processor  must  be  input  to  NDHIST.  If 
this  option  is  exercised,  the  field  or  fields  input  to  this  proc- 
essor and  their  order  of  input  must  be  the  same  as  those  input  to 
CLASSIFY  or  ISOCLS. 

Information  such  as  the  field,  cluster  or  subclass  number,  the 
frequency  of  occurrence,  and  color  code  for  each  histogrammed 
radiance  vector  is  written  on  the  n-dimensional  histogram  (NDIM) 
file. 

The  SCTRPL  processor  reads  the  NDIM  file,  and  a two-axis  color- 
coded  spectral  plot  (PLOTAP)  is  output  in  the  Universal  format. 
The  background  for  the  plot  may  be  black  or  white. 

If  more  than  two  channels  were  histogrammed  by  the  NDHIST  proc- 
essor, the  data  vector  is  reduced  to  two  components  by 

y ' = Ax  + b (2-4) 


where 


y ' * transformed  image 
A * matrix 
x = data  vector 
b = bias  vector 

The  location  on  the  scatter  plot  for  each  vector  in  the  NDIM 
file  is  determined  by  its  radiance  values  (if  only  two  channels 
were  histogrammed)  or  by  two  linear  combinations  of  radiance 
values  (if  more  than  two  channels  were  histogrammed) , 

The  color  for  the  pixel  is  assigned  by 

• Original  radiance  values 

• Mean  value  of  the  subclass  or  cluster  to  which  the  pixel  was 
assigned  during  classification  or  clustering 

• Mean  value  of  the  test  or  training  field  from  which  the  pixel 
was  extracted 

• User-defined  colors 

• Color  extraction  from  a different  pass  when  using  multi- 
registered  Landsat  data. 

Optionally,  for  pixel  color  assignment,  the  SAVTAP  file  created 
by  STAT  or  ISOCLS  may  be  input. 

Optionally,  a line-printer  pixel  frequency  or  log  of  pixel  fre- 
quency (base  2)  plot  is  given.  The  plot  is  printed  with  up  to 
16  different  symbols. 

Figure  2-1  is  a flow  diagram  of  the  EOD-LARSYS . 
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Figure  2-1.—  Flow  diagram  of  the  EOD-LARSYS 
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3.  SYSTEM  INPUT/OUTPUT  FORMATS 


» 

3.1  CARD  INPUT/OUTPUT 

Card  input  to  and  output  from  the  system  must  be  one  of  the  fol- 
lowing types. 

3.1.1  PROCESSOR  CARDS 

Processor  cards  identify  the  processor  that  is  to  be  executed. 
The  system  monitor  routine  calls  the  appropriate  processor, 
which  initiates  the  loading  of  all  routines  used  by  the  proc- 
essor. The  processor  card  is  always  a $ symbol  followed  by  the 
processor  name  and  must  always  be  punched  left  justified  begin- 
ning in  column  1.  No  blanks  are  allowed.  The  $ symbol  and  the 
first  five  characters  are  the  unique  processor  identification 
used  by  the  system  monitor  routine. 


Below  is  a list  of  all  processor  cards  recognized  by  the  system, 
along  with  the  section  in  which  each  processor  is  described. 


$HIST 

$GRAYMAP 

$STAT 

$ISOCLS 

$SELECT 

$CLASSIFY 

$D I SPLAY 

$DATA-TR 

$TRSTAT 

$NDHIST 

$SCTRPL 

$EXIT 


Section  6 
Section  7 
Section  8 
Section  9 
Section  10 
Section  11 
Section  12 
Section  13 
Section  14 
Section  15 
Section  16 

Execution  terminates  when  this  card  is 
encountered. 


3.1.2  CONTROL  CARDS 


Each  processor  has  its  own  set  of  control  cards  which  allow  the 
user  to  exercise  various  options  in  the  particular  processor  or 
to  change  the  default  value  assigned  to  certain  parameters  in  the 
system.  These  cards  must  immediately  follow  a processor  card. 

The  control  cards  are  identified  by  a keyword  in  columns  1 
through  10  of  the  card.  Only  the  first  six  characters  are  used 
for  testing.  In  columns  11  through  72,  the  parameter  values  or 
options  are  indicated.  These  columns  are  free  form,  blanks  are 
ignored  (unless  of  legitimate  parameter  value) , and  multiparameter 
values  or  options  are  separated  by  commas.  Columns  73  through  80 
of  the  card  are  not  used.  With  the  exceptions  of  the  *END* , $END* , 
and  in  some  cases  the  STATFILE  cards,  control  cards  may  occur  in 
any  order.  (The  STATFILE  control  card  exception  is  noted  in  the 
section  for  the  appropriate  processor.)  If  the  list  of  parameter 
values  for  a given  keyword  is  too  long  for  one  card,  the  remaining 
values  can  be  input  on  another  card  with  the  same  keyword.  (The 
continuation  of  a CATEGORY  control  card  is  slightly  different; 
see  section  11,  table  11-1.)  In  every  processor,  the  *END*  con- 
trol card  indicates  the  end  of  a set  of  control  cards,  and  the 
$END*  indicates  the  end  of  field  definition  card  input. 

3.1.3  CLASS,  SUBCLASS,  AND  FIELD  DEFINITIONS 

A field  is  a specific  block  of  data  to  be  extracted  from  the  input 
MSS  data  tape  (DATAPE)  and  processed.  It  is  defined  by  a sample 
increment,  a line  increment,  and  from  1 to  10  vertices.  Option- 
ally, the  user  may  associate  a name  with  each  field.  The  alpha- 
numeric field  description  is  located  in  columns  1 through  6.  In 
columns  11  through  72,  sample  and  line  increments  are  separated 
by  a comma  and  enclosed  in  parentheses.  A comma  separates  the 
increments  and  each  of  the  following  vertices.  The  vertices  must 
be  arranged  in  clockwise  order.  Sample  and  line  numbers  which 
describe  a vertex  are  separated  by  a comma  and  enclosed  in  paren- 
theses. The  sample  number  must  be  given  first  for  each  vertex. 


More  than  one  card  may  be  used  to  describe  a field.  An  asterisk 
occurring  after  a vertex  indicates  a continuation  card  is  to  be 
read  beginning  in  column  11.  A vertex  must  be  completed  on  a 
card  and  cannot  be  split  between  two  cards.  The  numbers  which 
describe  the  increments  and  vertices  must  be  integers. 

It  is  the  user's  responsibility  to  ascertain  that  all  defined 
fields  are  within  the  bounds  of  the  MSS  image.  In  determining 
which  pixels  belong  in  a particular  field , the  EOD-LARSYS  examines 
the  pixel  intercepts  of  each  scan  line  with  each  side  of  the  field. 
The  pixel  intercept  X,  with  the  scan  line  L and  the  side  defined 
by  vertices  (X^,Y^)  and  (X2 # Y2)  , is  calculate<*  by  the  equation: 


(L  - Y1)  (X2  - Xx) 

<Y2  - V 


(3-1) 


The  value  of  X is  computed  as  a f loating-point  number;  however, 
the  actual  pixel  intercept  must  be  an  integer  number.  Therefore, 
if  the  fractional  part  of  X is  greater  than  one-half,  the  pixel 
intercept  is  the  next  higher  integer  number.  If  the  fractional 
part  of  X is  less  than  one-half,  the  pixel  intercept  will  be  the 

next  lower  integer  number.  When  the  fractional  part  of  X is 

exactly  one-half,  the  integer  pixel  intercept  depends  on  the 
direction  of  movement  from  the  point  (X^,Y^)  to  (X2 , Y2)  . If  Y^ 

is  less  than  Y2,  the  pixel  intercept  is  the  next  higher  integer. 

If  Y^  is  greater  than  Y2,  the  pixel  intercept  is  the  next  lower 
integer  number. 

After  all  pixel  intercepts  for  a given  scan  line  have  been 
determined,  the  intercepts  are  taken  in  pairs  and  all  pixels 
between  and  including  the  pair  of  intercepts  are  included  in 
the  field.  In  the  following  example  for  scan  line  L,  all  pixels 
between  and  including  and  P^  are  included,  and  all  pixels 
between  and  including  P^  and  P^  are  included. 


In  example  2,  the  field  F2  has  the  same  vertices  as  FI;  however, 
the  sample  increment  is  2 and  the  line  increment  is  3. 


F2 


(2,3)  , (412,231) , (422,230) , (429,246) , (419,247) 

Sample 


Line 


230 

233 

236 

239 

242 

245 
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In  example  3,  the  sample  and  line  increments  for  field  W187  are 
equal  to  1,  and  there  are  six  vertices. 


W187  (1,1) , (324,266) , (335,264) , (338,271) , 

(330,272) , (329,269) , (326,269) 
Sample 


With  the  exception  of  SELECT,  TRSTAT , and  SCTRPL,  every  processor 
accepts  the  input  of  field  definition  cards.  Field  definitions 
are  always  input  between  the  *END*  and  the  $END*  control  cards. 

In  the  STAT  and  ISOCLS  processors,  fields  must  be  associated  with 
a class  or  subclass  name.  In  the  DISPLAY  processor,  fields  may 


// 


be  test  fields  or  designated  fields.  In  the  NDHIST  processor, 
fields  are  associated  with  class,  subclass,  test  or  training,  or 
any  user-defined  field. 

The  fields  defined  in  STAT  and  ISOCLS  are  called  training  fields, 
and  the  data  within  these  fields  are  used  for  computing  statistics. 
Training  fields  are  grouped  into  subclasses,  and  subclasses  are 
further  grouped  into  classes,  using  the  STAT  processor.  In  ISOCLS, 
training  fields  are  grouped  into  classes,  and  the  clustering  pro- 
cedure breaks  the  class  data  into  subclasses  (clusters) . To 
allow  for  these  groupings,  cards  bearing  a class  name  and  a 
subclass  name  are  necessary. 

A class  name  card  has  the  keyword  CLASSNAME  beginning  in  column  1 
and  the  alphanumeric  name  of  the  class  left  justified  in  columns  11 
through  16  of  the  card.  Blanks  should  not  be  embedded  in  the 
class  or  subclass  names. 

A subclass  name  card  has  the  keyword  SUBCLASS  beginning  in  col- 
umn 1 and  the  alphanumeric  name  of  the  class  left  justified  in 
columns  11  through  16  of  the  card. 

In  STAT,  a CLASSNAME  card  must  immediately  follow  the  *END* 
control  card.  The  CLASSNAME  card  is  followed  by  one  or  more 
SUBCLASS  cards,  each  of  which  must  be  followed  by  one  or  more 
field  definition  cards.  See  the  example  for  STAT  (section  8.4.4). 

In  ISOCLS,  a CLASSNAME  card  must  immediately  follow  the  *END* 
control  card.  The  CLASSNAME  card  is  immediately  followed  by 
one  or  more  field  definition  cards.  The  data  from  the  fields 
associated  with  a given  class  name  are  clustered  as  one  data 
set.  The  class  is  broken  into  subclasses  (clusters)  which  do 
not  have  field  boundaries.  So,  even  though  statistics  are 


computed  on  a subclass  level,  training  fields  cannot  be  asso- 
ciated with  subclasses  in  ISOCLS.  See  the  example  for  ISOCLS 
(section  9.5.4) . 

In  DISPLAY,  test  fields  (if  input)  must  be  identified  by  a pre- 
viously defined  class  or  subclass  name.  When  associated  with 
classes,  a CLASSNAME  card  should  immediately  follow  the  *END* 
control  card.  Test  fields  for  that  class  should  follow  imme- 
diately. When  associated  with  subclasses,  a SUBCLASS  card 
should  immediately  follow  the  *END*  card,  followed  by  the  test 
fields  for  that  subclass. 

Designated  fields  are  the  other  type  of  field  input  to  DISPLAY. 
Fields  may  be  designated  "unidentifiable"  or  "other. " For  input 
of  designated  fields,  a card  with  the  keyword  DESIGNATE  beginning 
in  column  1 and  the  keyword  OTHER  or  UNIDENTIFIABLE  beginning  in 
column  11  must  precede  the  field  definition  cards.  See  sec- 
tion 12.4.4  for  sample  input  of  test  and  designated  fields. 

3.1.4  SPECIAL  SYSTEM  DECKS 

The  card  decks  described  in  this  section  are  special  decks  nor- 
mally output  from  one  processor  to  be  used  at  some  future  time 
for  input  to  another  processor.  However,  if  the  user  can  obtain 
all  of  the  information  needed  for  any  of  the  special  card  decks 
from  some  other  source,  such  information  may  be  input  directly 
to  the  processor  if  the  formats  described  in  this  section  are 
followed. 

These  decks  are  always  included  in  the  deck  setup  with  the  control 
cards  for  the  particular  processor.  The  first  card  of  each  deck 
acts  as  a keyword  which  initiates  the  input  of  the  deck. 
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Upon  input  of  any  of  these  special  system  card  decks , a corre- 
sponding internal  file  is  written  with  the  information  read  from 
cards.  The  file  is  then  used  to  transfer  the  data  from  one  proc- 
essor to  another.  It  is  not  necessary  to  input  the  same  deck  to 
more  than  one  processor  in  the  same  run. 

3. 1.4.1  Module  STAT  Deck 

The  module  STAT  deck  is  optional  output  from  the  STAT,  ISOCLS, 
and  TRSTAT  processors.  It  contains  either  the  statistics  (mean 
vectors  and  covariance  matrices)  for  all  the  subclasses  input  to 
STAT  or  for  clusters  computed  by  ISOCLS  or  the  transformed  statis- 
tics for  all  subclasses  or  clusters  input  to  TRSTAT.  These  sta- 
tistics are  needed  in  the  computation  of  the  probability  density 
function  in  CLASSIFY  and  the  computation  of  separability  measures 
in  SELECT. 

This  deck  also  contains  all  the  training  field  boundaries,  the 
class  and  subclass  numbers  to  which  the  training  fields  belong, 
the  class  and  subclass  names,  the  number  of  subclasses  in  each 
class,  and  the  number  of  points  in  each  subclass  or  cluster.  By 
defining  the  required  training  fields  in  STAT,  the  user  has 
absolute  control  over  the  data  samples  which  will  define  a sub- 
class from  the  MSS  data  tape  (DATAPE) . Every  data  sample 
occurring  in  any  one  of  the  training  fields  defined  by  a partic- 
ular subclass  is  used  in  computing  the  mean  vector  and  covariance 
matrix  for  that  subclass. 

In  the  clustering  processor  ISOCLS,  the  user  has  no  control  over 
the  specific  samples  which  comprise  a cluster.  The  processor 
determines  which  data  samples  are  used  in  computing  the  mean 
vector  and  covariance  matrix  for  each  cluster.  Because  of  the 
desirability  of  using  these  cluster  statistics  in  other  proc- 
essors, the  ISOCLS  processor  punches  the  deck  and  creates  a file 
in  the  same  format  as  the  STAT  processor.  Training  fields  are 
associated  with  classes  rather  than  subclasses.  Clusters  are 


given  a six-character  name.  The  first  three  characters  are  the 
first  three  characters  of  the  class  name  associated  with  the 
cluster,  and  the  last  three  characters  are  digits.  The  digits 
for  the  subclasses  are  in  sequential  order. 

When  the  module  STAT  deck  is  input  to  the  CLASSIFY  or  SELECT 
processor,  the  user  may  request  subsets  of  the  statistics  to 
be  used  for  classification  or  channel  selection  via  the  CHANNELS 
and  SUBCLASS  control  cards  in  both  processors.  Subclasses  are 
numbered  as  they  were  input  to  STAT,  and  clusters  are  numbered 
as  they  were  created  in  ISOCLS.  The  channels  are  numbered  as 
they  occur  on  the  MSS  data  tape  (DATAPE) . To  select  a subset  of 
the  statistics  in  the  module  STAT  deck,  the  user  should  indicate 
by  number  the  subclasses  and/or  channels  he  wishes  to  use. 

(Unless  the  user  has  previous  knowledge  of  the  number  of  clusters 
in  the  module  STAT  deck,  he  or  she  cannot  accurately  select  a 
subset  of  the  clusters  when  executing  ISOCLS  back  to  back  with 
another  processor.) 

The  first  card  in  the  module  STAT  deck  acts  as  a control  card, 
with  the  keyword  MODULE  initializing  the  input  of  the  remainder 
of  the  deck.  The  entire  deck  is  composed  of  the  card  types 
listed  below.  All  integers  should  be  right  justified  in  the 
specified  field,  and  alphanumeric  characters  should  be  left 
justified  in  the  specified  field. 

• Card  type  1 — Keyword  MODULE  in  columns  1 through  6. 

• Card  type  2 — Number  of  classes,  subclasses,  channels,  fields, 
and  vertices  for  training  fields. 

Columns  Type/format  Definition 

7—10  Integer/14  Number  of  training  classes  from  STAT  or 

ISOCLS 

19—20  Integer/12  Number  of  training  subclasses  from  STAT 

(clusters  from  ISOCLS) 


Columns  Type/format 


Definition 


29—30  Integer/12  Number  of  channels  used  in  computation 

of  statistics 

38—40  Integer/13  Number  of  training  fields  input  to  STAT 

or  ISOCLS 

49—52  Integer/14  Number  of  vertices  in  all  the  training 

fields 

Card  type  3 — Actual  channels  used  in  computation  of  statistics 


Columns  Type/format 

11—12  Integer/12  Channel  1 

13—14  Integer/12  Channel  2 

15—16  Integer/! 2 Channel  3 


Definition 


69—70  Integer/12  Channel  30 

Card  type  4 — Training  field  information:  The  first  card  of 

the  set.^ 


Columns  Type/format 

1—6  Alphanumeric/A6  Field  name 


Definition 


11-12  Integer/12 


21-22  Integer/12 


T-’imber  of  the  class  associated  with 
this  field 

Number  of  the  subclass  associated 
with  this  field  input  to  STAT  (Since 
ISOCLS  associates  fields  with  classes 
ISOCLS  dummies  this  information  by 
setting  it  equal  to  zero.) 


Card  types  4 and  5 define  a training  field.  To  complete  the  set 
of  information  for  one  training  field,  one  card  of  type  4 and 
one  or  two  cards  of  type  5 are  required.  The  number  of  card 
sets  is  determined  by  the  number  of  training  fields. 
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Columns  Type/format  Definition 

31—32  Integer/12  Number  of  vertices  for  this  field, 

including  closure  point 

• Card  type  5 — Vertices  for  the  training  field:  Up  to  10  ver- 

tices plus  the  closure  point  are  allowable  for  each  training 
field,  7 vertices  per  card  with  coordinates  ordered  (sample, 
line) . The  coordinates  are  listed  in  a clockwise  manner, 
with  the  coordinate  having  the  smallest  sample  number  listed 
first. ^ 


Columns 

Type/format 

Definition 

11-15 

Integer/15 

Sample  number  of  first  vertex 

16-20 

Integer/15 

Line  number  of  first  vertex 

21-25 

Integer/I5 

Sample  number  of  second  vertex 

26-30 

• 

• 

Integer/15 

• 

Line  number  of  second  vertex 

• 

• 

76-80 

• 

Integer/15 

• 

Line  number  of  the  seventh  vertex 

• Card  type  6 — Class  names,  nine  names  per  card,  left  justified 
ia  field:  The  number  of  cards  is  determined  by  the  number  of 

classes . 

Columns  Type/format  Definition 

11—16  Alphanumeric/A6  Six-character  class  name  for  first 

class 

19—24  Alphanumeric/A6  Six-character  class  name  for  second 

class 


2 ' 

Card  types  4 and  5 define  a training  field.  To  complete  the  set 

of  information  for  one  training  field,  one  card  of  type  4 and 

one  or  two  cards  of  type  5 are  required.  The  number  of  card 

sets  is  determined  by  the  number  of  training  fields. 


Columns  Type/format 


Definition 


27—32  Alphanun\eric/A6  Six-character  class  name  for  third 

class 


I 

X -• 


75-80 


Six-character  class  name  for  the 
ninth  class 


Card  type  7 — Number  of  subclasses  in  each  class,  24  per  card: 
The  number  of  cards  is  determined  by  the  number  of  subclasses. 


Columns 

Type/format 

Definition 

9-10 

Integer/I2 

Number 

of 

subclasses 

in 

first  class 

12-13 

Integer/12 

Number 

of 

subclasses 

in 

second  class 

15-16 

• 

Integer/12 

• 

Number 

of 

subclasses 

• 

• 

in 

third  class 

• 

78-79 

• 

Integer/12 

Number 

of 

• 

subclasses 

in 

24th  class 

Card  type  8 — Subclass  names,  10  per  card:  The  number  of 

cards  is  determined  by  the  number  of  subclasses. 


Columns  Type/format 
9—14  Alphanumeric/A6 

16—21  Alphanumeric/A6 

23—28  Alphanumeric/A6 


Definition 

Six-character  subclass  name  for  first 
subclass 

Six-character  subclass  name  for 
second  subclass 

Six-character  subclass  name  for  third 
subclass 


72—77  Alphanumeric/A6  Six-character  subclass  name  for 

10th  subclass 


3XT2 

$5" 


* V 
* 


• * w- 


To  complete  the  set  of  statistics  for  one  subclass,  the  following 
three  types  of  cards  are  grouped  together.  The  number  of  sets  of 
cards  is  determined  by  the  number  of  subclasses. 

• Card  type  9 — Number  of  points  in  this  subclass. 

Columns  Type/format  Definition 

13—20  Integer/18  Number  of  points  in  this  subclass 

• Card  type  10  — Mean  vector  for  this  subclass,  five  values  per 

card:  The  number  of  cards  is  determined  by  the  number  of 

channels . 

Columns  Type/format  Definition 


6-20 

Real/E15. 8 

Mean 

for 

first 

channel 

for 

this 

subclass 

21-35 

Real/El5. 8 

Mean 

for 

second 

channel 

for 

this 

subclass 

36-50 

Real/El5 . 8 

Mean 

for 

third 

channel 

for 

this 

subclass 

51-65 

Real/El5 . 8 

Mean 

for 

fourth 

channel 

for 

■ this 

subclass 

66-80 

Real/E15 . 8 

Mean 

for 

fifth 

channel 

for 

this 

subclass 

• Card  type  11  — Covariance  matrix  for  this  subclass:  Only  the 

lower  triangular  portion  of  the  matrix  is  punched;  the  number 
of  values  input  for  this  matrix  is  equal  to  (number  of 
channels)  * (number  of  channels  + l)/2.  Five  values  are 
punched  on  each  card  in  the  order  indicated. 

1 

2 3 

4 5 

7 8 


6 

9 10 


3 


Definition 


Columns 

Type/format 

Def  i 

,nj 

Ltic 

Ml 

6-20 

Real/El5 . 8 

Element 

1 

of 

matrix 

21-35 

Real/E15 . 8 

Element 

2 

of 

matrix 

36-50 

Real/E15 . 8 

Element 

3 

of 

matrix 

51-65 

Real/E15 . 8 

Element 

4 

of 

matrix 

66-80 

Real/E15. 8 

Element 

5 

of 

f atrix 

3. 1.4. 2 

B-Matrix  Deck 

This  deck  is  an  optional  output  of  the  SELECT  processor  when  the 
Davidon-Fletcher-Powell  Procedure  is  used.  The  deck  contains  a 
transformation  matrix  which  extremizes  a given  separability  meas 
ure  for  the  subclasses  being  used.  The  matrix  is  optimized 
using  the  Davidon-Fletcher-Powell  Procedure.  The  linear  trans- 
formation of  the  original  measurements  can  be  used  in  the 
CLASSIFY,  SCTRPL,  TRSTAT,  or  DATA-TR  procedure  to  reduce  the 
dimensionality  of  the  data  and/or  statistics. 

The  B-matrix  deck,  or  corresponding  file,  is  an  optional  input 
to  SELECT,  SCTRPL,  and  CLASSIFY  and  a required  input  to  DATA-TR 
and  TRSTAT  (see  A-MATRIX  control  card  in  section  14.4.3).  When 
input  to  SELECT,  the  matrix  is  used  to  evaluate  a specific  sepa- 
rability measure  or  it  is  used  as  a first  guess  for  the  Davidon- 
Fletcher-Powell  Procedure,  depending  on  the  user's  request.  Whe 
input  to  CLASSIFY,  classification  is  performed  using  the  linear 
transformation.  When  input  to  SCTRPL,  the  dimension  of  the  data 
from  the  MSS  data  tape  (DATAPE)  is  reduced  to  two  linear  com- 
binations. When  input  to  TRSTAT,  a new  file  containing  the 
transformed  statistics  is  created  on  SAVTAP.  The  DATA-TR  proc- 
essor uses  the  matrix  to  create  a new  image  tape  with  the 
reduced  dimensionality. 


The  keyword  B-MATRIX  on  a control  card  indicates  that  the 
B-matrix  is  being  input.  Since  the  matrix  may  be  on  cards  or 
file,  the  parameter  CARDS  or  FILE  must  be  punched  on  the  same 
card  in  columns  11  through  72.  The  entire  card  deck  is  defined 
below  by  card  types. 

• Card  type  1 — The  keyword  B-MATRIX  in  columns  1 through  10 
and  CARDS  in  columns  11  through  72  initialize  input  of  the 
card  deck. 


• Card  type  2 — One  card  of  this  type. 


Columns 

Type/format 

Definition 

6-7 

Integer/12 

Number  of 

linear  combinations. 

13-14 

Integer/12 

Number  of 

channels . 

18-80 

Integer/3012 

The  remainder  of  this  card  lists  by 

number  the  channels  for  which  the  matrix 
was  computed;  e.g.,  columns  18  through 
19,  first  channel,  etc.,  for  a maximum 
of  30  channels  right  justified  in  the 
field. 


• Card  type  3 — The  number  of  these  cards  is  determined  by  the 
size  of  the  matrix.  The  values  are  input  by  column  as  indi- 
cated below,  five  values  per  card. 


B(k,n),  k = linear  combinations;  n = channels 


B = 


1 (k  + 1)  • • • 

2 (k  + 2)  ••• 

3 (k  + 3) 


[nk  - (k  - 1)] 
[nk  - (k  - 2)] 
[nk  - (k  - 3) ] 


Columns 

Type/format 

Definition 

6-20 

Real/E15. 8 

Element 

1 

of  matrix 

21-35 

Real /El 5. 8 

Element 

2 

of  matrix 

36-50 

Real/E15 . 8 

Element 

3 

of  matrix 

51-65 

Real/E15 . 8 

Element 

4 

of  matrix 

66-80 

Real/El5 . 8 

Element 

5 

of  matrix 

(Continued  on  next  card) 


3. 1.4. 3 Cluster  Means  Deck 

This  deck  is  an  optional  input  to  the  clustering  processor 
ISOCLS . It  may  be  used  to  initialize  the  clustering  process  by 
estimating  cluster  centers  (means) . The  means  can  be  taken  from 
the  module  STAT  deck  (see  section  3. 1.4.1)  punched  by  either 
TRSTAT,  STAT,  ISOCLS,  or  the  user.  Means  may  be  input  for  up  to 
30  channels  for  each  cluster  center,  and  a subset  of  the  chan- 
nels to  be  used  may  be  indicated  on  the  CHANNELS  control  card. 

The  keyword  MEANS  in  the  control  cards  for  ISOCLS  indicates 
initial  cluster  means  are  being  input.  Since  the  means  may  be 
on  cards  or  from  a file,  the  keyword  CARDS  or  FILE  must  be 
punched  on  the  same  card  in  columns  11  through  72.  If  on  cards, 
CARDS  initializes  input  of  the  cluster  means  deck  which  must 
immediately  follow.  The  format  for  the  entire  card  deck  is 
indicated  below. 

• Card  type  1 — Control  card  keyword  MEANS  is  left  justified  in 
columns  1 through  5.  The  keyword  CARDS  in  columns  11  through 
72  initializes  input  of  the  card  deck. 


• Card  type  2 — Number  of  clusters  and  channels 


Columns 

Type/format 

6-10 

Integer/15 

25-30 

Integer/15 

Card  type  3 — Actual 

Columns 

Type/format 

6-7 

Integer/12 

8-9 

Integer/12 

10-11 

Integer/12 

Definition 

Number  of  initial  clusters  for  which 
means  are  provided 

Number  of  channels  for  which  means  are 
provided 

channels  used  in  computation  of  means. 
Definition 

Channel  1 
Channel  2 
Channel  3 


64—65  Integer/12  Channel  30 


• Card  type  4 — Mean  vectors  for  the  initial  clusters:  These 

cards  are  in  the  same  format  as  the  means  cards  (card  type  10) 
in  the  module  STAT  deck.  The  first  mean  for  each  cluster 
always  begins  on  a new  card.  The  number  of  cards  depends  on 
the  number  of  channels  and  the  number  of  clusters.  Five 
values  are  placed  on  each  card. 


Columns  Type/format 
6-20  Real/El 5. 8 

21-35  Real/El 5. 8 

36-50  Real/El5 . 8 
51-65  Rea 1/El 5. 8 

66-80  Real/El5 . 8 


Definition 
Mean  for  channel  1 
Mean  for  channel  2 
Mean  for  channel  3 
Mean  for  channel  4 


Mean  for  channel  5 

(Continued  on  consecutive  cards  of  the 
same  format) 


3.2  MSS  IMAGE  DATA  TAPES 


Every  processor  except  SELECT,  DISPLAY,  TRSTAT , and  SCTRPL 
requires  the  input  of  an  MSS  data  tape  (DATAPE) . The  tape 
assignment  defaults  to  logical  unit  C (Fortran  unit  3) , but  the 
user  may  assign  any  unit  available  by  input  of  the  DATAFILE  con- 
trol card.  For  details,  see  the  file  assignment  chart  in 
section  4 and  the  control  card  section  for  each  processor. 

The  tape  may  be  in  either  the  LARSYS  II  (or  III)  format  or  the 
Universal  format.  These  formats  are  defined  in  appendixes  A 
and  B,  respectively. 

The  control  card  DATAFILE  allows  the  user  to  communicate  the 
file  number  of  the  MSS  data  tape  (DATAPE)  to  be  processed  and  the 
logical  unit  assignment.  This  is  optional  input  to  every  proc- 
essor that  requires  the  MSS  data  tape.  The  first  f ile  ,of  the 
tape  will  be  processed  unless  otherwise  specified  by  the  DATAFILE 
control  card.  In  executing  the  same  and/or  different  processors 
back  to  back,  the  DATAFILE  control  card  may  be  input  only  to  the 
first  processor  executed  if  the  same  file  and  logical  unit  are 
to  be  used  throughout  the  execution.  For  example, 

$HIST  [File  2 of  the  MSS  data  tape 

DATAFILE  UNIT=3 ,FILE=2  (DATAPE)  assigned  to  unit  C is 

: (Other  control  cards)  P^efed  by  GRAYMAP  as  ”el1  as 

♦END* 

(Field  definition) 

$END* 

$ GRAYMAP 
CHANNELS  5,6 
♦END* 

(Field  definitions) 

$END* 

The  user  should  be  aware  of  the  following  Univac  1108  EXEC  2 
system  options  in  assigning  an  MSS  data.  A tape  is  always 
assigned  to  a seven-track  drive  unless  the  user  requests  in 


writing  that  the  operator  assign  the  tape  to  a nine-track  drive. 

If  the  user's  tape  is  in  fact  nine  track,  he  must  inform  the 
operator  on  NASA/JSC  form  588,  which  he  submits  with  the  computer 
run.  In  addition,  the  Univac  ASG  card  in  the  deck  setup  must  have 
the  N-option  punched  on  it.  Since  most  nine-track  tapes  will  be 
coming  from  a source  other  than  the  Univac  system,  the  A-option, 
which  allows  frame-count  errors,  should  be  punched  on  the  ASG 
card  as  well.  This  option  should  also  be  used  on  seven-track 
tapes  from  sources  other  than  the  Univac  computer.  For  example, 

@NA  ASG  C= (number  of  nine-track  data  tape  from  source  other  than 
Univac) 

@A  ASG  C= (number  of  seven-track  data  tape  from  source  other  than 
Univac) 

NOTE:  @ is  the  Univac  control  punch.  It  is  a 7-8  multipunch 

in  column  1 of  the  card. 
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4.  SYSTEM  INTERNAL  FILES 


The  files  described  in  this  section  are  used  internally  by  the 
system  to  pass  information  between  processors  * It  is  the  user's 
responsibility  to  assign  the  necessary  files  for  his  particular 
job.  On  the  Univac  1108  EXEC  2 system,  files  are  assigned  in 
the  runstream  with  the  ASG  system  command. 


For  example,  to  assign  logical  units  A and  B to  Fastrand,  the 
following  commands  are  used. 


@ ASG  A 
@ ASG  B 


§ ASG  A,B 


To  assign  logical  unit  B to  tape,  the  following  command  is  used. 

@ ASG  B=(tape  number) 

When  a required  file  is  not  assigned  in  the  runstream  or 
dynamically  by  the  program,  the  EXEC  2 system  will  default  the 
assignment  to  the  FH432  drum. 

Since  the  FH432  drum  is  the  highest  speed  input/output  device 
available,  the  EOD-LARSYS  system  uses  it  for  random  access 
scratch  input/output.  If  the  user  fails  to  assign  one  of  the 
files  indicated  below  to  Fastrand  or  tape,  the  file  will  be 
written  on  the  FH432  drum  and  will  probably  be  destroyed  by 
some  later  scratch  input/output. 

4.1  STATISTICS  FILE  (SAVTAP,  UNIT  A) 

This  file  must  be  assigned  to  either  Fastrand  or  tape  whenever 
one  or  more  of  the  processors  STAT,  SELECT,  CLASSIFY,  ISOCLS , 
NDHIST,  SCTRPL , TRSTAT,  or  DATA-TR  are  executed.  One  file  is 
written  on  this  unit  for  each  execution  of  STAT,  TRSTAT,  or  ISOCLS 


or  for  input  of  a module  STAT  deck  to  some  other  processor.  The 
file  contains  the  same  information  as  itemized  in  section  3. 1.4.1 
for  the  module  STAT  deck. 

Multiple  files  may  be  written  on  a single  tape  and  accessed  by 
using  the  STATPILE  control  card.  This  control  card  communicates 
the  file  number  for  positioning  the  tape  and  the  logical  unit 
assignment.  The  first  file  is  always  assumed  unless  otherwise 
specified  by  the  user,  and  the  unit  assignment  assumes  logical 
unit  A (Fortran  unit  1)  unless  otherwise  specified  by  the  STATFILE 
control  card.  In  executing  several  processors  back  to  back  and 
in  referencing  the  same  file,  only  one  STATFILE  control  card  need 
be  submitted.  If  different  file  numbers  are  to  be  referenced 
during  one  execution,  then  the  file  number  may  be  changed  from 
one  processor  to  the  next  by  input  of  the  STATFILE  control  card 
to  each  processor.  For  example, 

$STAT 

STATFILE  UNIT=1,FILE=2 
(Other  control  cards) 

♦END* 

(Class,  subclass,  and  field  definitions) 

$END* 

$CLASSIFY 

*END* 

(Fields  to  be  classified) 

$END* 

The  STAT  processor  will  write  the  training  statistics  for  this 
run  on  file  2 of  the  SAVTAP  file  (unit  A).  (The  system  files, 
their  logical  units,  and  assignments  are  set  out  in  table  4-1.) 
CLASSIFY  will  use  all  of  the  statistics  on  file  2 of  the  tape 
for  classification. 

The  following  example  shows  assignments  for  back-to-back  execu- 
tions Of  STAT,  ISOCLS , and  SELECT. 


$STAT 

STATFILE  UNIT=1 , FILE=2 
(Other  control  cards) 

♦END* 

(Class,  subclass,  and  field  definitions) 

$END* 

$ISOCLS 

STATFILE  UNIT=1,FILE=3 
(Other  control  cards) 

♦END* 

(Class  and  field  definitions) 

$END* 

$SELECT 

STATFILE  UNIT= 1 , F ILE=2 

BEST  4 

♦END* 

$END* 

STAT  will  write  on  file  2 of  the  tape  assigned  to  unit  A,  ISOCLS 
will  write  on  file  3 of  the  same  tape,  and  SELECT  will  go  back 
to  file  2 of  unit  A for  the  statistics  computed  by  STAT. 


4.2  B-MATRIX  FILE  (BMFILE , UNIT  H) 

The  BMFILE  contains  the  transformation  matrix  which  corresponds 
to  the  B-matrix  deck  (section  3. 1.4. 2).  The  deck  or  file  is  an 
optional  input  to  SELECT,  SCTRPL,  and  CLASSIFY  and  a required 
input  to  DATA-TR  and  TRSTAT.  When  the  card  deck  is  input  to  any 
of  these  processors,  this  file  is  automatically  written.  The 
B-matrix  is  computed  by  the  SELECT  processor  and  automatically 
output  to  the  file  when  the  Davidon-Fletcher-Powell  Procedure 
is  executed. 


The  file  must  be  assigned  to  Fastrand  or  tape. 


4.3  ONE-DIMENSIONAL  HISTOGRAM  FILE  (HISFIL,  UNIT  K) 

On  logical  unit  K,  the  HIST  processor  creates  the  HISFIL  file, 
which  is  used  by  the  GRAYMAP  processor.  The  file  should  be 
assigned  to  tape  or  Fastrand. 
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4.4  CLASSIFICATION  MAP  FILE  (MAPTAP,  UNIT  B) 


The  MAPTAP  file  (appendix  C)  must  be  assigned  to  either  Fastrand 
or  tape  when  the  CLASSIFY  and  DISPLAY  processors  are  executed. 

The  file,  which  is  output  by  CLASSIFY,  contains  the  statistics 
actually  used  in  the  classification,  the  training  field  informa- 
tion, and  ail  of  the  classification  results. 

It  is  suggested  that  this  file  be  assigned  to  tape  and  saved 
when  CLASSIFY  is  executed.  This  will  allow  the  user  to  make 
several  runs  of  DISPLAY  without  classifying  every  time. 

4.5  N-DIMENSIONAL  HISTOGRAM  FILE  (NDIM,  UNIT  D) 

The  NDHIST  processor  writes  the  NDIM  file  to  be  used  as  an 
interface  to  the  SCTRPL  processor.  The  tape  default  assignment 
is  unit  D,  but  the  user  may  assign  any  available  unit.  (See 
file  assignment  chart,  table  4-1.)  The  NDIM  format  is  defined 
in  appendix  D. 

4.6  TRANSFORMED  STATISTICS  FILS  (SAVTAP,  UNIT  A) 

The  TRSTAT  processor  writes  the  transformed  statistics  on  the 
SAVTAP  file.  (See  section  4.1  for  further  information.) 

4.7  SCRATCH  FILES 

The  EOD-LARSYS  uses  the  available  high-speed  FH432  and  FH1782 
drums  for  scratch  input/output.  SELECT,  ISOCLS,  DISPLAY,  NDHIST, 
and  SCTRPL  use  the  scratch  input/output  files. 


5.  SYSTEM  OUTPUT  TAPE  FILES 


5.1  MAP  TAPE  (MAPFIL,  UNIT  N) 

On  logical  unit  N (Fortran  unit  16)  the  DISPLAY  processor 
optionally  outputs  a multifile  data  tape  (MAPFIL)  containing 
the  subclass  number  to  which  each  corresponding  pixel  was 
assigned  during  classification  by  CLASSIFY.  Also,  on  logical 
unit  N,  the  ISOCLS  processor  outputs  a MAPFIL  containing  either 
the  cluster  number  (OPTION  CLUSTER  control  card)  or  the  mean 
vector  to  which  each  corresponding,  pixel  was  assigned  during 
clustering.  A color  key  containing  the  color  code  for  each 
cluster  is  given  for  the  mean  vectors.  The  color  codes  option- 
ally may  be  ordered  according  to  the  cluster  number  or  to 
greenness  (OPTION  ORDER  control  card).  (See  section  9.5.3, 
table  9-1,  for  ISOCLS  control  cards.) 

The  results  of  the  classification/clustering  may  be  displayed  on 
the  Passive  Microwave  Imaging  System  (PMIS)  or  the  Bendix  100 
DAS . The  tape  must  be  mounted  on  a nine-track  tape  drive  com- 
patible with  the  DAS  and  may  be  output  in  either  the  LARSYS  II 
or  Universal  format.  The  display  may  be  made  without  the  color 
keys  (appendixes  A and  B)  or  with  color  keys  (see  appendix  E for 
tape  format) . To  exercise  this  option,  see  FORMAT  control  card 
(table  9-1)  for  the  ISOCLS  and  section  12  (table  12-1)  for  the 
DISPLAY  processor.1 

One  file  is  written  on  the  output  tape  for  each  field  classified 
or  clustered. 

5.2  SCATTER  PLOT  DATA  TAPE  (PLOTAP,  UNIT  E) 

The  SCTRPL  processor  outputs  two-axis  color-coded  spectral  plots 
on  a multifile  Universal-formatted  tape.  The  tape  default 

^These  data  are  available  as  input  to  NDHIST  via  seven-  or  nine-track  tape 
or  Fastrand. 
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assignment  is  unit  E,  but  the  user  may  assign  any  available 
unit.  (See  file  assignment  chart,  table  4-1.)  The  tape  format 
is  defined  in  appendix  F. 

5.3  transformed  data  tape  (TRFORM,  unit  L) 

The  DATA-TR  processor  outputs  a multifile  image  tape  of  trans- 
formed data.  The  image  tape  may  be  produced  in  either  the 
LARSYS  II  or  Universal  format  defined  in  appendixes  A and  B, 
respectively.  The  tape  must  be  assigned  to  logical  unit  L 
(Fortran  unit  14). 

File  assignments  for  all  processors  are  given  in  table  4-1. 


6.  ONE-DIMENSIONAL  HISTOGRAM  PROCESSOR  - HIST 


The  processor  HIST  computes  individual  field  histograms  and  a 
total  histogram  for  all  the  fields  and  channels  defined  by  the 
user.  An  individual  statistics  report  is  printed  for  every 
field  histogrammed.  The  report  contains  field  descriptions, 
data  ranges,  means,  standard  deviations,  and  normalized  ranges 
(mean  ±3  standard  deviations) . 

A cumulative  histogram  of  all  the  fields  is  calculated  and 
written  on  an  internal  file  to  be  read  later  by  the  GRAYMAP 
processor.  Like  the  field  histograms,  a statistics  report  is 
printed  for  the  combined  fields. 

The  input  DISPLAY  control  card  allows  the  user  to  obtain  a line 
printer  plot  of  the  histograms.  A histogram  for  each  channel  on 
the  DISPLAY  card  (described  in  table  6-1)  is  displayed  for 
each  field,  along  with  a cumulative  histogram  for  all  the  fields. 

6. 1 INPUT  FILES 

An  MSS  data  tape  (DATAPE)  must  be  input.  The  tape  assignment 
defaults  to  logical  unit  C (Fortran  unit  3) , but,  by  input  of  the 
DATAFILE  control  card,  the  user  may  assign  any  available  logical 
unit.  (See  table  4-1  for  file  assignments  and  section  3.2,  MSS 
Image  Data  Tapes,  for  further  information  on  format.) 

6.2  OUTPUT  FILES 

The  HIST  processor  writes  a file  for  the  GRAYMAP  processor  on 
logical  unit  K (Fortran  unit  13) . This  file  (HISFIL)  contains 
the  histogram  data  for  each  channel  requested.  HISFIL  may  be 
assigned  to  Fastrand  or  tape  or  be  defaulted  (no  assignment)  to 
the  FH432  drum. 


6.3  SCRATCH  FILES 


The  HIST  processor  does  not  require  an  additional  scratch  file. 
6.4  CARD  INPUT 

The  formats  for  all  system  card  input  are  defined  in  section  3.1. 

6.4.1  PROCESSOR  CARD 

The  processor  keyword  is  left  justified  starting  in  column  1. 

For  example, 

$HIST 

This  card  directs  the  system  monitor  routine  to  select  the  HIST 
processor  and  causes  all  the  routines  used  by  the  HIST  processor 
to  be  loaded  into  the  system. 

6.4.2  SPECIAL  SYSTEM  DECKS 

The  HIST  processor  does  not  use  any  special  input  decks. 

6.4.3  CONTROL  CARDS 

Control  cards  allow  the  user  to  input  various  options.  These 
cards  are  identified  by  a keyword  left  justified  in  columns  1 
through  10  of  the  card,  with  the  parameter  values  or  additional 
keywords  in  columns  11  through  72  (beginning  in  any  column 
after  column  10) . These  control  cards  may  be  in  any  order,  but 
they  must  be  the  first  cards  after  the  processor  card  $HIST. 
Table  6-1  lists  all  available  options,  along  with  their  default 
values . 


6.4.4  FIELD  DEFINITIONS 


Fields  to  be  histogrammed  are  input  immediately  following  the 
*END*  control  card.  The  card  column  format  for  field  definitions 
is  defined  in  section  3.1.2.  Input  of  field  definition  cards  is 
terminated  by  the  $END*  control  card. 

6.4.5  DECK  SETUP 

The  HIST  deck  setups  are  given  in  figure  6-1.  The  @ indicates 
the  master  punch  for  the  Univac  system.  Card  column  1 is  the 
7-8  multipunch. 

6.5  CARD  OUTPUT 

This  processor  does  not  output  any  card  decks. 

6.6  SAMPLE  COMPUTER  RUNS 

The  sample  test  cases  shown  in  figures  6-2  and  6-3  illustrate  the 
output  from  the  HIST  processor  using  all  defaults  for  one  run  and 
all  options  for  another,  respectively.  Two  fields  from  a Hill- 
County-North  data  tape  were  used  in  each  case. 

In  the  first  run,  four  channels  were  histogrammed  using  all 
defaults . 

In  the  second  run,  three  channels  were  histogrammed  but  only  two 
were  displayed.  The  minimum  radiance  XLOW  was  set  to  zero  while 
the  maximum  radiance  XHXGH  was  set  to  110.  The  height  of  the 
Y-axis,  YSIZ , was  set  to  12.  This  run  took  1.089  minutes. 
Appendix  G gives  execution  times  for  the  HIST  and  other 
processors . 

6.7  RESTRICTIONS 

a.  The  maximum  number  of  channels  is  30. 


b.  The  number  of  histograms  requested  to  be  plotted  may  be 

limited  if  internal  dimensions  are  too  small  for  all  user 
requests.  (For  example,  if  the  user  requests  30  channels 
to  be  histogrammed,  only  14  of  those  histograms  may  be 
plotted;  however,  all  30  will  be  histogrammed.) 

| This  limitation  is  a function  of  the  number  of  channels 

requested  on  the  CHANNELS  control  card.  If  too  many  chan- 
nels are  indicated  on  the  DISPLAY  control  card,  a diagnostic 
is  printed  but  execution  continues. 

i c.  The  DISPLAY  card  must  be  a subset  of  the  CHANNELS  card. 

i 

' d.  The  data  for  all  channels  for  one  scan  line  are  unpacked 

| into  an  array  dimensioned  12  000.  If  the  number  of  channels 

times  [(sample  end  - sample  begin) /sample  increment]  exceeds 
12  000,  a diagnostic  message  is  printed.  Sample  end  is  reset 

i to  fit  the  dimensions  and  execution  continues. 

\ 

r 

6 . 8 DIAGNOSTIC  MESSAGES 

The  diagnostic  messages  and  the  subroutines  in  which  they  appear 
are  as  follows . 


6.8.1  SUBROUTINE  HISTIC 
Message 

ONLY  THE  FIRST  50  FIELD 
DESCRIPTIONS  WERE  PRINTED, 
BUT  ALL  THE  FIELDS  WERE 
INCLUDED  IN  THE  TOTAL  HISTO- 
GRAMMED STATS. 


Explanation 

The  user  has  input  more  than 
50  fields,  and  only  the  first 
50  field  descriptions  will  be 
printed  in  the  "Data  Blocks 
Histogrammed"  portion  of  the 
total  report;  however,  all  the 
input  fields  were  included  in 
the  calculations  of  the  "Total 
Histogrammed  Statistics. 


P 


II 


6.8.2  SUBROUTINE  SETUPS 


Message 

CHANNEL , 12,  IS  NOT  A SUBSET 
OF  THE  CHANNELS  GIVEN  ON 
CHANNELS  CARD. 


Explanation 

A channel  on  the  DISPLAY  card  is 
not  a member  of  the  set  of  chan- 
nels on  the  CHANNELS  card. 


TOO  MANY  CHANNELS  ARE  BEING 
HISTOGRAMMED  AND  PLOTTED, 
NO.  OF  CHANNELS  PLOTTED  WAS 
RESET  TO 


User  requested  too  many  histograms 
to  be  plotted.  The  number  of  histo 
grams  plotted  varies  according  to 
the  number  of  channels  histogrammed 


6.8.3  SUBROUTINE  HISTGM 
Message 

TOO  MUCH  DATA  REQUESTED  — 
SAMPLE  END  WAS  RESET  TO 


Explanation 

The  data  for  all  channels  for  one 
scan  line  are  unpacked  into  an 
array  dimensioned  12  000.  If  the 
number  of  channels  times  [ (sample 
end  - sample  begin) /sample  incre- 
ment] exceeds  12  000,  this  diag- 
nostic is  printed.  Sample  end  is 
reset  to  fit  the  dimensions  and 
execution  continues. 


TABLE  6-1.-  HIST  PROCESSOR  OPTIONS  AND  CONTROL  CARDS 


Keyword 

Parameter  and 
default  values 

Function 

(a) 

(b) 

CHANNELS 

Cl'C2'C3'*"'Ck 

k<30 

Default:  None 

Channels  to  be  histogrammed, 
C^,C2fC2, • • • fC^,  should  be 
integer  numbers  separated  by 
commas • 

SIZE 

XHIGH=K 

0<K<255 

Default:  XHIGH=255 

K is  an  integer  which  sets  the 

maximum  radiance  value  which 

will  be  histogrammed.  XHIGH 

becomes  X„  on  the  X-axis  of 

max 

the  histogram  plot.c 

SIZE 

XLOW=J 
0<J<XHIGH 
Default:  XLOW=0 

J is  an  integer  which  sets  the 

minimum  radiance  value  which 

will  be  histogrammed.  XLOW 

becomes  X ■ „ on  the  X-axis  of 
mm 

the  histogram  plot. 

SIZE 

YSIZ=L 
0<L<£ (x) max 
Default:  YSIZ=15 

L is  an  integer  which  sets  the 

height  of  the  Y-axis  (number 

of  print  lines) . Using  the 

input  YSIZ , the  Y-axis  scale 

for  the  histogram  plot  will  be 

determined  by  the  processor  to 

be:  f (x)  +(YSIZ-1) /YSIZ . 

max 

DISPLAY 

C^fC2»C2»  * * * / 
k<  30 

Default:  No  plots 

Channels  for  which  histograms 
will  be  plotted.  C^,C2,C3, 
•••fC^  must  be  a subset  of 
the  CHANNELS  card. 

aThe  keyword  must  be  left  justified  in'  card  columns  1 through  10. 

bThe  parameter  and  default  values  are  in  card  columns  11  through  72  (beginning 
in  any  column  past  10) . 
c 

The  difference  between  XHIGH  and  XLOW  must  be  at  least  100. 


TABLE  6-1.—  Continued 


Keyword 

DATAFILE 


HEDl 


HED2 


DATE 


COMMENT 


Appendix 

card. 


Parameter  and 

default  values  Function 


UNIT=N,FILE=M  N is  the  Fortran  logical  unit 

Default:  N=3,M=1  number  to  which  the  MSS  data 

tape  (DATAPE)  has  been  assigned; 

M is  the  file  number  for  the  tape 
to  be  processed.  For  back-to- 
back  executions  of  more  than  one 
processor,  if  using  the  same 
file  number,  only  one  DATAFILE 
control  card  need  be  submitted.*^ 


Any  60  characters  Replaces  first  header  line  with 
beginning  in  column  11  the  indicated  characters  in  the 
Default:  LYNDON  B.  parameter  field. 

JOHNSON  SPACE  CENTER 


Any  60  characters 
beginning  in  column  11 
Default:  HOUSTON, 

TEXAS 

Any  12  characters 
beginning  in  column  11 
Default:  Current  date 


Replaces  second  header  line 
with  the  indicated  characters 
in  the  parameter  field. 


Prints  the  indicated  12  charac- 
ters in  the  right  corner  of  the 
heading  in  place  of  the  current 
date. 


Any  60  characters  Prints  a comment  line  using  the 

beginning  in  column  11  60  characters  found  in  the 

Default:  No  comments  parameter  field, 

printed 


contains  a sample  run  demonstrating  use  of  the  DATAFILE  control 


TABLE  6-1.—  Concluded 


Parameter  and 

Keyword  default  values  Function 


*END* 

Blank 

Signals  the  end  of  the  control 
cards. 

$END* 

Blank 

Signals  the  end  of  all  card 
input  for  the  processing 

function. 
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@ XQT  CUR 


@ ASG  K 


e ASG  C= (MSS  DATAPE  number) 


(s  ASG  Z= 


(program  tape  number) 


@N  MSG  TAPES  2 


@Z  RUN  — 


(Front  of  dock) 


Assigns  HISFIL  to  Fastrand 
File  will  default  to  high- 
speed drum  if  this  card  is 
omitted. 


(a)  For  independent  execution. 

Figure  6-1.—  Deck  setup  for  the  HIST  processor. 


$STAT  /\ (Training  field  cards) 


(Front  of  dock) 

Figure  6-1. — Concluded. 
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Figure  6-2.—  Sample  program  listing  and  output  for  the 
HIST  processor  using  all  defaults. 
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Figure  6-2.—  Continued 
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Figure  6-3.-  Sample  program  listing  and  output  for  the 
HIST  processor  using  all  options. 
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Figure  6-3.—  Continued 
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Figure  6-3.—  Continued 
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Figure  6-3.—  Continued 
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Figure  6-3.—  Continued 
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Figure  6-3.—  Continued 


Figure  6-3.-  Concluded 


7 . GRAYMAP  PROCESSOR 


The  chief  purpose  of  GRAYMAP  is  to  produce  alphanumeric  pictorial 
printouts  of  digitized  airborne  MSS  data.  Each  data  sample  is 
eight  bits,  providing  256  possible  gray  levels  for  each  MSS  data 
channel.  To  allow  a meaningful  distinction  in  gray-scale  tones, 
GRAYMAP  assigns  each  of  the  256  levels  to  1 of  as  many  as  16  pos- 
sible symbols.  These  symbols  may  be  preassigned  or  arbitrarily 
assigned  for  each  run.  The  specifications  for  the  bin  edges  for 
each  symbol  may  be  assigned  arbitrarily  by  the  user  for  each  run 
or  computed  from  the  histogram  data  in  order  to  result  in  equal 
activity  for  each  of  the  symbols.  In  any  case,  the  data  are 
subsequently  output  in  terms  of  symbols , and  each  symbol  repre- 
sents a range  of  data  values  in  which  the  corresponding  data 
points  fall. 


7.1  INPUT  FILES 

An  MSS  data  tape  (DA-TAPE)  must  be  input  to  the  GRAYMAP  processor. 
The  tape  assignment  defaults  to  logical  unit  C (Fortran  unit  3) ; 
however,  by  input  of  the  DATAFILE  control  card,  the  user  may 
assign  any  available  logical  unit.  (See  table  4-1  for  file 
assignments  and  section  3.2,  MSS  Image  Data  Tapes,  for  further 
information  on  format.) 

The  GRAYMAP  processor  requires  the  bin  levels  to  be  input  on  a 
control  card  or  computed  from  the  histograms  output  by  the  HIST 
processor  on  the  HISFIL  file.  When  the  bin  levels  are  to  be  com- 
puted, logical  unit  K (Fortran  unit  13)  may  be  assigned  to  either 
Fastrand  or  tape  or  allowed  to  default  to  drum  (no  assignment) . 

If  the  HIST  processor  has  not  been  executed  prior  to  running 
GRAYMAP  and  bin  levels  have  not  been  input,  a default  histogram 
of  every  10th  line  for  500  lines  and  every'^Oth  sample  for  200 
samples  is  computed,  and  HISFIL  i_s  created  on  logical  unit  K. 


Figure  7-1  shows  the  interaction  of  the  HIST  and  GRAYMAP 
processors. 

7.2  OUTPUT  FILES 

No  files  are  output  by  the  GRAYMAP  processor. 

7.3  SCRATCH  FILES 

The  GRAYMAP  processor  does  not  require  additional  scratch  files. 

7 . 4 CARD  INPUT 

All  system  card  input  formats  referred  to  in  this  section  are 
defined  in  section  3.1. 

7.4.1  PROCESSOR  CARD 

The  processor  keyword  is  left  justified  starting  in  column  1. 

For  example, 

$ GRAYMAP 

This  card  directs  the  system  monitor  routine  to  select  the  GRAYMAP 
processor  and  initiates  loading  of  routines  used  by  GRAYMAP. 

7.4.2  SPECIAL  SYSTEM  DECKS 

None  of  the  special  system  card  decks  are  required  for  this 
processor. 

7.4.3  CONTROL  CARDS 

Table  7-1  lists  all  available  options  and  control  cards  recognized 
by  GRAYMAP,  along  with  their  default  values. 

7.4.4  FIELD  DEFINITIONS 

Fields  for  which  gray-scale  maps  are  desired  must  follow  the 
*END*  control  card.  See  section  3.1.3  for  the  format  of  field 


• A-**,* 


definition  cards.  Field  definition  input  is  terminated  by  the 
$END*  control  card. 

7.4.5  DECK  SETUP 

The  GRAYMAP  deck  setups  are  given  in  figure  7-2.  The  @ indicates 
the  master  punch  for  the  Univac  system,  which  is  the  7-8  multi- 
punch. The  deck  setup  in  figure  7-2 (a)  allows  HISFIL  on  unit  K 
to  default  to  drum. 

7.5  CARD  OUTPUT 

The  GRAYMAP  processor  produces  no  card  output. 

7 . 6 SAMPLE  COMPUTER  RUNS 

Test  case  1 for  GRAYMAP  (fig.  7-3)  illustrates  the  output  when 
the  bin  levels  and  two  sets  of  symbols  are  input.  Two  channels 
from  a Hill-County-North  data  tape  were  input,  but  only  the 
output  from  channel  1 is  shown. 

Test  case  2 for  HIST  and  GRAYMAP  (fig.  7-4)  included  channels  1 
and  2.  The  bin  levels  were  calculated  from  the  histogrammed  data 
and  default  symbols  were  used.  HIST  took  0.748  minute  while 
GRAYMAP  took  2.215  minutes.  All  the  output  from  HIST  is  shown, 
but  only  channel  1 output  from  GRAYMAP  is  included.  Appendix  G 
gives  sample  execution  times  for  the  HIST,  GRAYMAP,  and  other 
processors . 

7.7  RESTRICTIONS 

The  system-related  restrictions  in  section  17  apply  to  this 
processor . 

The  maximum  number  of  channels  allowed  is  30,  and  the  maximum 
number  of  bin  levels  is  16. 


7^3^ 


7.8  DIAGNOSTIC  MESSAGES 


BAD  SUPERVISOR  CONTROL  CARD.  Check  spelling  of  keyword. 

ONLY  16  BINLEVELS  PERMITTED.  Reduce  the  number  of  bin  levels  to 

16. 

THIS  CHANNEL  IS  NOT  Check  CHANNELS  control  card  and 

HISTOGRAMMED.  make  sure  all  channels  requested 

have  been  histogrammed. 

THIS  CHANNEL  IS  OUT  OF  All  channels  requested  must  be  in 

NUMERICAL  RANGE  AND  WAS  the  range  1 to  30. 

IGNORED. 


YOU  HAVE  ASKED  FOR  TOO  MANY  The  last  sample  is  reset  to  the 

SAMPLES.  THE  LAST  SAMPLE  IS  last  sample  on  the  data  tape. 


TABLE  7-1.-  GRAYMAP  PROCESSOR  OPTIONS  AND  CONTROL  CARDS 


Keyword 

(a) 


CHANNELS 


BINLEVEL 


SYMBOLS 


DATAFILE 


Parameter  and 
default  values 

(b) Function 


C1'C2'C3 
k<  30 


Default:  Gray  map  for 
all  channels  on  HISFIL 
(created  by  a previous 
execution  of  HIST) 


Provides  pictorial  printout 
for  requested  channels. 


Nl,N2,N3,...,Nk 

k<16 

Default:  Histograms 

used  to  set  bin  levels 


Upper  bin  edges  for  gray- 
scale levels  with  a range  of 
0 to  255  and  a maximum  of 
16  levels;  the  last  bin  level 
should  always  be  255. 


S1'S2,S3' ‘ * * 'Sk 
k<16 

Default:  Two  sets  of 

10  symbols  overprinted, 
resulting  in  one  of 

9,X,0,0,*,=,*,— ,/ ,b 


Character  set  separated  by 
commas,  with  a maximum  of 
16  symbols  per  SYMBOL  card. 
If  2 sets  are  input,  the 
second  overprints  the  first. 
The  number  of  symbols  input 
on  one  card  determines  the 
number  of  bin  levels  when 
using  the  histograms  to  set 
the  levels.  Blank  is  a 
legitimate  character. 


UNIT=N , FILE=M 
Default:  N=3,M=1 


N is  the  Fortran  logical  unit 
number  to  which  the  image  data 
tape  has  been  assigned;  M is 
the  file  number  on  the  tape  to 


a 

The  keyword  must  be  left  justified  in  card  columns  1 through  10. 

b • 

The  parameter  and  default  values  are  in  card  columns  11  through  72  (beginning 

in  any  column  past  10) . 


TABLE  7-1.—  Concluded. 


Keyword 

HED1 

HED2 

DATE 

COMMENT 

★ END* 
$END* 


Parameter  and 

default  values  Function 


be  processed.  For  back-to-back 
executions  of  more  than  one 
processor,  if  using  the  same 
file  number,  only  one  DATAFILE 
control  card  need  be  submitted. 


Any  60  characters  Replaces  first  header  line 

beginning  in  column  11  with  the  indicated  characters 
Default:  LYNDON  B.  in  the  parameter  field. 

JOHNSON  SPACE  CENTER 


Any  60  characters 
beginning  in  column  11 
Default:  HOUSTON, 

TEXAS 


Replaces  second  header  line 
with  the  indicated  characters 
in  the  parameter  field. 


Any  12  characters 
beginning  in  column  11 
Default:  Current  date 

Any  60  characters 
beginning  in  column  11 
Default:  No  comments 

printed 


Prints  the  indicated  12  char- 
acters in  the  right  corner  of 
the  heading  in  place  of  the 
current  date. 

Prints  a comment  line  using 
the  60  characters  found  in 
the  parameter  field. 


Blank 


Blank 


Signals  the  end  of  the  control 
cards . 

Signals  the  end  of  all  card 
input  for  the  processing 
function. 


(Back  of  dock) 


i$EXIT 


|$END* 

(Variable  number  of  field 
definition  cards) 


*END* 


(Control  cards) 


$GRAYMAP 


§ XQT  LARSAA 


TRW  Z 


IN  Z 


TRW  Z 


XQT  CUR 


13  ASG  C= (MSS  DATAPE  number) 
ASG  Z=  (program  tape  number) 

/ 

{ 3 MSG  TAPES  2 


(Front  of  deck) 


(a)  For  independent  execution. 

Figure  7-2.—  Deck  setup  for  the  GRAYMAP  processor, 


| @ XQT  LARSAA 


(: 


/' 


( 

( 

f 

|@  ASG  K 


TRW  Z 


IN  Z 


@ XQT  CUR 


L 


f@R  ASG  C=  (MSS  DATAPE  number) 


r 


@R  ASG  Z= (program  tape  number) 


@N  MSG  TAPES  2 FASTRAND  1 


Assigns  HISFIL  to  Fastrand. 
If  this  card  is  omitted,  the 
file  will  default  to  drum. 


RUN  — 

(Front  cf  dock) 


(b)  For  execution  back  to  back  with  HIST. 
Figure  7-2.-  Continued. 
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Figure  7-3.-  Program  listing  and  output  for  the  GRAYMAP  processor  with  bin  levels 

and  two  sets  of  symbols  input. 
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Figure  7-3.—  Continued 
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Figure  7-3.—  Continued 
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Figure  7-3.—  Continued 
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Figure  7-3.—  Continued 
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Figure  7-3.—  Continued 
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Figure  7-3.-  Concluded 
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Figure  7-4.-  Continued 
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Figure  7-4.—  Continued 
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Figure  7-4 Concluded 


8.  STATISTICS  PROCESSOR  - STAT 


The  statistics  processor  STAT  computes  the  multichannel  means, 
standard  deviations,  covariance  matrix,  and  correlation  coeffi- 
cient for  each  training  field  and  all  training  subclasses  which 
are  defined  through  user  input  to  the  processor.  In  addition,  at 
the  user's  option,  histograms  and  spectral  plots  may  be  computed 
for  each  field  and/or  subclass. 

The  STAT  processor  requires  user  input  of  both  punched  cards  and 
an  MSS  data  tape  (DATAPE) . Card  input  consists  of  an  optional 
number  of  cards  from  the  set  of  control  cards  defined  in 
table  8-1  and  the  training  field  definitions  described  in 
section  3.1.3.  The  required  input  MSS  data  must  encompass  the 
area  of  interest  specified  in  the  training  field  definitions. 

The  processor  is  activated  and  initialized  by  a specific  proc- 
essor card  defined  in  section  8.4.1.  All  processor  functions 
which  are  available  as  options  to  the  user  are  directed  by  means 
of  the  input  control  cards  or  the  built-in  defaults  for  any 
control  card  which  is  not  input. 

In  addition  to  the  optional  printouts  under  the  direction  of  the 
control  cards,  the  STAT  processor  creates  the  output  file  SAVTAP, 
which  contains  the  computed  statistics  (mean  vector  and  covari- 
ance matrix)  for  each  training  subclass.  The  training  subclass 
statistics  optionally  are  output  on  punched  cards  (the  module 
STAT  deck) . Both  the  output  statistics  file  SAVTAP  and  the  out- 
put module  STAT  deck  are  in  a format  acceptable  to  the  statistics 
input  requirements  of  other  processors  in  the  EOD-LARSYS. 

Figure  8-1  gives  the  functional  flow  of  the  STAT  processor. 


The  mean  vector  for  the  ith  subclass  is  computed  as  follows: 


ui  = xli'x2i'”-'Xpi'"‘,XPi 

where 

Ni 

— 1 

Xpi  = Xpj  = avera9e  value  in  channel  p for  subclass  i 

1 D=1 

p = channel  number. 

P = largest  channel  number. 

- number  of  samples  in  all  training  fields  for  subclass  i 

X . = the  j th  sample  of  the  MSS  data  for  channel  p (a  value 

P 3 

between  0 and  255) . 

= mean  vector  for  the  ith  subclass. 

The  covariance  matrix  for  the  ith  subclass  is  computed  as 
follows: 


q = channel  number 


4 


Closely  related  statistics  are  the  standard  deviation  and  cor- 
relation coefficient  for  the  ith  subclass,  which  are  computed 
as  follows: 


api  ^ppi^ 


1/2 


(8-3) 


pqi 


J&L 


(k__.k_  . ) 
ppi  qqx 
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where 


k . = element  of  the  covariance  matrix  for  subclass  i;  the 

pqi 

variance  between  channels  p and  q. 
cr  ^ = standard  deviation  in  channel  p for  subclass  i;  p * q. 

= correlation  coefficient  between  channels  p and  q for 
subclass  i. 


8.1  INPUT  FILES 

An  MSS  data  tape  (DATAPE)  must  be  input  to  the  STAT  processor. 
The  tape  assignment  defaults  to  logical  unit  C (Fortran  unit  3) ; 
however,  by  input  of  the  DATAFILE  control  card,  the  user  may 
assign  any  available  logical  unit.  (See  table  4-1  for  file 
assignments  and  section  3.2,  MSS  Image  Data  Tapes,  for  further 
information  on  format.) 


8.2  OUTPUT  FILES 

The  STAT  processor  always  outputs  the  statistics  on  the  SAVTAP 
file  and,  optionally  (by  means  of  the  OPTION  PUNCH  control  card), 
provides  the  module  STAT  deck.  (See  section  3.1. 4.1  for  format 
of  card  deck.)  The  only  user  action  required  for  the  card  output 
is  a notation  on  NASA/JSC  form  588,  which  is  submitted  with  the 
run  deck,  to  the  effect  that  punch-card  output  is  being  generated 
in  the  run. 


?£ 


The  required  output  file  SAVTAP  will  contain  the  class  names, 
subclass  names,  and  the  subclass  statistics  (mean  vector  and 
covariance  matrix)  computed  by  the  STAT  processor  for  every 
subclass  defined.  The  output  statistics  file  must  be  assigned 
to  either  Fastrand  or  tape.  The  tape  assignment  defaults  to 
logical  unit  A (Fortran  unit  1) ; however,  by  input  of  the 
STATFILE  control  card,  the  user  may  assign  any  available  unit. 

(See  table  4-1  for  file  assignments  and  table  8-1  for  control 
card  description.) 

Using  the  STATFILE  control  card  the  statistics  from  more  than  one 
execution  of  STAT  may  be  saved  on  the  same  tape. 

8.3  SCRATCH  FILES 

The  STAT  processor  does  not  require  the  use  of  a separate 

scratch  file. 

8.4  CARD  INPUT 

The  specific  card  column  formats  for  the  information  to  be  input 
on  the  processor  and  control  cards  are  given  in  sections  3.1.1 
and  3.1.2.  Table  8-1  describes  the  complete  set  of  keywords 
and  option  parameters  recognized  and  acted  upon  by  the  STAT 

processor. 

If  possible,  each  keyword  and  its  option  parameters  are  to  be 
completely  contained  on  one  control  card.  However,  if  more  param- 
eters are  required  than  can  be  contained  on  one  card,  the  control 
card  may  be  repeated  and  the  parameters  continued  on  the  next  con- 
trol card.  The  parameters  for  a control  card  of  a given  type  will 
be  cumulative  over  all  cards  of  that  type. 


a 


The  control  cards  follow  the  $STAT  processor  card.  All  options 
available  on  the  STAT  processor  have  a default  setting  which  is 
used  by  the  processor  for  those  option  parameters  not  input  via 
control  card.  The  control  card  *END*  must  be  input  to  signify 
the  end  of  the  set  of  control  cards.  Immediately  following  the 
♦END*  card,  a set  or  sets  of  class  names,  subclass  names,  and 
training  field  definition  cards  must  be  input.  See  section  8.4.4 
for  further  details  on  training  field  definitions. 

8.4.1  PROCESSOR  CARD 

The  processor  keyword  is  left  justified  starting  in  column  1. 

For  example , 

$STAT 

This  card  directs  the  system  monitor  routine  to  select  the  STAT 
processor  and  initiates  the  input  of  STAT  processor  control 
cards . 

8.4.2  SPECIAL  SYSTEM  DECKS 

The  processor  does  not  expect  input  of  any  of  the  system-generated 
card  decks  described  in  section  3.1.4. 

8.4.3  CONTROL  CARDS 

Table  8-1  gives  the  complete  set  of  control  cards  which  the  user 
may  input  to  direct  the  STAT  processor  functions  and  the  default 
functions  performed  by  the  processor.  With  the  exception  of  the 
♦END*  and  $END*  control  cards,  the  sequence  of  the  control  cards 
is  optional.  The  *END*  card  must  immediately  follow  the  last 
control  card,  if  any;  the  CLASSNAME,  SUBCLASS  name,  and  training 
field  definition  cards  must  immediately  follow  the  *END*  card; 
and  the  $END*  card  must  immediately  follow  the  last  card  of  the 
input  training  field  definitions. 


8.4.4  CLASS,  SUBCLASS,  AND  FIELD  DEFINITIONS 

All  class,  subclass,  and  field  definition  cards  occur  between 
the  *END*  and  $END*  control  cards.  The  formats  for  these  cards 
are  given  in  section  3.1.3.  Training  fields  are  grouped  into 
statistically  similar  subclasses,  and  subclasses  are  grouped 

further  into  classes. 

A training  class  is  defined  to  the  processor  by  one  card  contain- 
ing the  keyword  CLASSNAME  in  columns  1 through  9.  The  user- 
determined  alphanumeric  name  to  be  assigned  to  the  class  begins 
in  column  11  and  may  contain  a maximum  of  six  characters  (through 
column  16) . At  least  one  CLASSNAME  card  must  be  input. 

A CLASSNAME  card  must  be  followed  by  at  least  one  subclass 
grouping.  A subclass  grouping  is  on  a SUBCLASS  card  followed 
immediately  by  one  or  more  field  definition  cards.  All  fields 
defined  by  field  definition  cards  following  the  SUBCLASS  card 
will  contribute  a cumulative  sample  set  from  which  the  training 
subclass  statistics  will  be  computed  for  the  named  subclass. 

The  set  of  cards  — one  SUBCLASS  card  followed  by  one  or  more 
field  definition  cards  — generates  the  statistics  for  one  train- 
ing subclass.  The  number  of  sets  of  SUBCLASS  and  field  definition 
cards  is  determined  by  the  number  of  sets  of  training  subclass 
statistics.  The  number  of  training  fields  to  be  defined  for  one 
given  subclass  is  not  restricted.  The  following  example  shows 
the  grouping  of  subclasses  into  classes. 
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$STAT 


* (Control 
*END* 


cards) 


CLASSNAME 

SUBCLASS 


• (Fields) 


SUBCLASS 


• (Fields) 


CLASSNAME 

SUBCLASS 


• (Fields) 


CLASSNAME 

SUBCLASS 


; (Fields) 
SUBCLASS 
; (Fields) 
SUBCLASS 
\ (Fields) 
$END* 


fcT7> 


CLASS 1 
SUB  11 


SUB  11  (of  CLASS 1) 


SUB  12 


SUB  12  (of  CLASS1) 


CLASS 2 
SUB  21 


SUB  21  (of  CLASS 2 ) 


CLASS 3 
SUB  31 


SUB  13  (of  CLASS 3) 


SUB  32 


SUB  32  (of  CLASS3) 


SUB  33 


SUB  33  (of  CLASS3) 


....  . a- 


8.4.5  DECK  SETUP 


The  STAT  deck  setup  is  given  in  figure  8-2.  The  % in  column  1 
indicates  the  master  punch  for  the  Univac  system,  which  is  the 

7-8  multipunch. 


8.5  CARD  OUTPUT 

The  STAT  processor  will  optionally  output  on  punched  cards  the 
module  STAT  deck  (section  3. 1.4.1).  The  module  STAT  deck 
contains  the  training  field  vertices,  the  subclass  names  for 
each  training  subclass,  the  subclass  numbers  assigned  to  each 
training  subclass,  the  association  of  training  fields  to  each 
subclass  and  class,  and  the  computed  statistics  for  each  train- 
ing subclass. 

The  module  STAT  deck  is  output  by  the  processor  only  on  demand 
specified  by  user  input  of  the  OPTION  PUNCH  control  card. 

8.6  SAMPLE  COMPUTER  RUNS 

The  sample  computer  run  shown  in  figure  8-3  illustrates  the 
output  from  the  STAT  processor  using  most  available  user  options. 
The  output  was  generated  using  a Hi 11 -County -North  data  tape, 

28  channels  per  scan  line.  Figure  8-4  is  a program  listing 
using  only  defaults. 


In  the  first  run  (fig.  8-3) , the  options  not  used  included 
NOCOVAR,  HIST=F , COVAR=C,  and  COVAR=F.  The  NOCOVAR  option 
would  suppress  all  output  normally  obtained  under  the  COVAR 
options.  The  HIST=F  option  would  have  displayed  a histogram  of 
all  the  fields.  The  COVAR=C  option  would  print  out  the  subclass 
statistics,  whereas  the  COVAR=F  option  would  print  out  the 
field  statistics;  the  COVAR  option  generates  the  same  output 
as  these  two  combined.  The  typical  output  for  a field  or  sub- 
class is  shown  for  each  option  illustrated.  The  run  generated 
similar  output  for  every  field  and  every  subclass  defined. 
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The  second  run  (fig.  8-4)  demonstrates  the  use  of  all  defaults 
Output  from  this  run  is  not  shown. 


8.7  RESTRICTIONS 

The  system- re la ted  restrictions  in  section  17  apply  to  the  STAT 
processor. 

In  addition,  a core  storage  limitation  is  associated  with  the 

total  storage  required  by  the  training  subclass  statistics  and 

the  various  options  (i.e.,  producing  histograms  and  spectral 

plots) . The  upper  limit  on  core  storage  available  for  all 

requirements  generated  by  the  input  to  the  STAT  processor  is 

10  600  locations.  Each  subclass  covariance  matrix  requires 

2 

approximately  1/2 (number  of  channels)  locations;  each  subclass 
mean  vector  requires  locations  equal  to  the  number  of  chan- 
nels; and  each  training  field  requires  seven  locations.  If  a 
large  number  of  subclasses,  channels,  and  training  fields  in 
combination  with  one  or  more  of  the  options  available  by  means 
of  the  OPTION  control  cards  causes  the  core  storage  limits  to  be 
exceeded,  the  STAT  processor  prints  a diagnostic  message  request- 
ing the  user  to  decrease  options,  after  which  it  terminates  exe- 
cution (see  section  8.8). 

The  following  formula  determines  the  maximum  number  of  fields 
that  can  be  input  for  a case  without  any  histograms  (eq.  8-4) 
and  another  case  with  subclass  histograms  (eq.  8-5) . 

10  600  - 5NOSPEC  + 7MAXSUB  + l-  - 2MA*St?  *— NOFEAT  + l\ NOFEAT  + 40 
NOFLD  « 55 — 

(8-4 


where 

NOFLD  = number  of  fields 


f*  2 


! 


NOSPEC  = number  of  subclasses  grouped  together  for  spectrogram 
(maximum  of  20) 


MAXSUB  = maximum  number  of  subclasses 
NOFEAT  = number  of  channels 


10  600  - (5NOSPEC  + 7MAXSUB  + 


M0FU0 


(a— 


2 MAXSUB  + 5, 


NOFEAT  + 1) NOFEAT  ♦ 40  + XSIZ 


32 


(8-5) 


If  fields  and  subclasses  need  to  be  histogrammed,  a value  for 
XSIZ (NOHIST)  + 1 should  be  added  to  the  numerator  of  equa- 
tion (8-5) , where 


XSIZ  = range  of  histogram  (maximum  of  101) 
NOHIST  = number  of  channels  histogrammed 


8 . 8 DIAGNOSTIC  MESSAGES 


The  diagnostic  messages  provided  by  the  STAT  processor  are 
listed,  along  with  probable  cause  and  remedy  of  the  condition 
which  prompted  the  message.  During  statistical  computations, 
other  messages  also  may  be  output  by  utility  routines  common 
to  both  STAT  and  other  processors  in  the  EOD-LARSYS.  See  the 
system-related  messages  in  section  6 for  additional  messages 
obtained  from  a STAT  execution. 


Message 


/////  FROM  SUBR.  SETUP1  BAD 

CONTROL  CARD  ENCOUNTERED  


INPUT  CARD  IS 
• • • CCC 1 


'CCCC 


Explanation 

The  input  card  which  was  read 
has  none  of  the  legitimate 
keywords  to  identify  it  as  a 
recognizable  control  card. 

The  card  which  caused  the 
message  is  printed  out  as 
part  of  the  message.  Although 
the  processor  will  continue  to 
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Message 


Explanation 

read  more  control  cards , this 
is  an  indication  of  an  error 
in  the  deck  setup.  The  deck 
should  be  checked  for  proper 
control  cards  and  proper 
sequence  of  cards. 


CHECK  CHANNELS  OR  CLASS  NOS. 
REQUESTED  — CANNOT  BE  LESS  THAN 
OR  EQUAL  ZERO,  OR  GREATER  THAN 
30 

*****  TERMINATING  PROGRAM  EXE- 
CUTION FROM  SUBR.  SETUP1  ***** 


/////FROM  SUB.  SETUP1  

DECREASE  OPTIONS  *****  TER- 
MINATING PROGRAM  EXECUTION  FROM 
SUBR.  SETUP1  ***** 


1 


If  the  channel  numbers  speci- 
fied on  a HISTO  or  CHANNELS 
control  card  are  not  integers 
within  the  range  1 through  30, 
this  message  results.  The 
processor  terminates  execu- 
tion after  printing  this 
message.  Check  the  format 
of  the  applicable  processor 
control  cards  (see  section  3.1 
and  table  8-1) . 

The  STAT  processor  has  run  out 
of  internal  storage  to  handle 
the  combination  of  the  quanti- 
ties of  input  training  fields, 
subclasses,  and  channels. 
Internal  storage  is  fixed  at 
10  600  locations.  Each  sub- 
class requires  roughly 

2 

1/2 (number  of  channels)  loca- 
tions for  the  subclass  sta- 
tistics. If  histograms  or 
spectral  plots  of  subclasses 
and/or  fields  are  requested, 
additional  internal  storage  is 
required.  The  options  speci- 
fied in  the  run  deck  (i.e.. 
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Message 


***  STAT/SETUP1  — ERROR  IN 
OPTION  (?)  REQUESTED  - SCAN  OF 
OPTI  >"£  IS)  DISCONTINUED  AT 
CARD  COLUMN  XX  *** 


***MAXSUB=XX  MAX.  NO.  OF 

SUBCLASSES  CANNOT  BE  GREATER 
THAN  YY 

MAXSUB  SET=YY  PROCEEDING  TO  NEXT 
OPTION (S)  *** 


Explanation 

histograms  and  spectral  plots) 
and  possibly  the  quantities 
of  subclasses,  channels,  and 
training  fields  must  be 
decreased  or  eliminated  in 
order  to  get  a successful  run 
within  the  core  storage 
limitation. 

An  OPTION  control  card  is  not 
acceptable  to  the  processor. 
The  scan  of  the  options  will 
be  discontinued  by  the  proc- 
essor, and  any  options  speci- 
fied beyond  the  erroneous  one 
will  not  be  activated  for  the 
run.  The  processor  continues 
with  reading  of  the  next  con- 
trol card.  (See  section  3.1.2 
and  table  8-1  for  correct 
OPTION  control  card  usage.) 

The  maximum  subclass  number 
input  on  the  OPTION  MAXSUB 
control  card  exceeds  the  maxi- 
mum number  of  subclasses  that 
can  be  handled  by  the  EOD- 
LARSYS.  The  processor  will 
set  the  maximum  number  of 
subclasses,  which  will  apply 
to  subclasses  read  in  from 
the  input  subclass/field 
definition  deck. 


Message 

*****  STAT/LEARNN— MAX.  OF  XX 
SUBCLASSES  EXCEEDED  — FIRST  XX 
SUBCLASSES  USED  — REMAINDER 
IGNORED 


*****  STAT/LEARNN  — MAX.  OF  XX 

FIELDS  EXCEEDED  XX  FIELDS 

RETAINED  FOR  YY  SUBCLASSES 

*****  REMAINDER  OF  INPUT  TRAIN- 
ING FIELDS  NOT  USED 


Explanation 

The  processor  has  read  the 
maximum  allowable  number  or 
subclass  names  and  training 
fields  to  be  associated  with 
each  subclass,  and  the  next 
subclass  name  encountered  in 
the  training  field/subclass 
definition  deck  caused  this 
diagnostic  message.  The  first 
MAXSUB  subclasses  and  associated 
training  fields  input  are  com- 
puted and  the  remainder  are 
ignored  by  the  processor. 

The  STAT  processor  has  read  the 
maximum  number  of  subclass 
names  and  associated  training 
fields  from  the  input  training 
field/subclass  definition 
deck.  The  available  internal 
storage  has  been  filled,  and 
no  further  training  fields  can 
be  accepted.  Training  sta- 
tistics will  be  computed  for 
the  subclasses  and  fields 
which  have  been  read  to  this 
point,  and  the  remainder  are 
ignored  by  the  processor. 
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TABLE  8-1.-  STAT  PROCESSOR  OPTIONS  AND  CONTROL  CARDS 


Keyword 

(a) 

CHANNELS 


OPTION 


OPTION 


Parameter  and 
default  values 
(b) 

NrN2,N3,...,Nk 

l<k<30 

Default:  k=30;  unless 

the  MSS  data  tape 
(DATAPE)  has  exactly 
30  channels,  the 
default  should  not  be 
taken. 

PUNCH 

Default:  The  module 

STAT  deck  is  not 
punched,  in  which  case 
statistics  are  output 
on  the  SAVTAP  file 
only. 


MAXSUB=N 
Default : 


MAXSUB=1 5 


Function 

N's  are  the  integer  channel 
numbers  used  by  the  processor 
in  computing  training  subclass 
and  training  field  statistics; 
must  be  from  the  set  of  chan- 
nels available  on  the  MSS 
DATAPE  file. 

The  subclass  mean  vector  and 
covariance  matrix  for  every 
subclass  defined  by  user  input 
will  be  punched  on  cards  in  a 
format  acceptable  as  input  to 
other  processors  in  the  system. 
This  punched  card  deck  is  the 
module  STAT  deck  defined  in 
section  3. 1.4.1. 

Informs  the  processor  as  to  the 
maximum  number  of  subclasses 
which  will  be  input.  The  param- 
eter value  is  used  for  dimen- 
sioning purposes  and  reflects 
the  maximum  number  of  available 
computer  storage  locations  being 
utilized  for  other  options 
allowed  by  the  STAT  processor. 
This  parameter  must  be  set  by 


The  keyword  must  be  left  justified  in  card  columns  1 through  10. 

3 1 

The  parameter  and  default  values  are  in  card  columns  11  through  72  (beginning 
in  any  column  past  10) . 
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TABLE  8-1.—  Continued 


Keyword 


OPTION 


OPTION 


OPTION 


Parameter  and 
default  values 


COVAR 

Default:  Statistics 

are  not  printed. 


Function 

the  user  if  the  number  of  sub- 
classes he  is  about  to  define 
will  exceed  the  default.  It  is 
advisable  to  use  this  option 
when  a large  number  of  training 
fields  are  to  be  processed  or 
when  histograms  have  been 
requested. 

The  multichannel  means,  stand- 
ard deviations,  and  covariance 
matrix  (lower  triangular  por- 
tion) are  printed  out  for  each 
training  subclass  and  training 
field  defined  in  the  input 
training  field  definition 
deck. 


COVAR=C 

Default;  Statistics 
are  not  printed. 


The  multichannel  means,  stand- 
ard deviations,  and  covariance 
matrix  (lower  triangular  por- 
tion) are  printed  out  for  each 
training  subclass  defined  in 
the  input  training  field 
definition  deck. 


COVAR=F 

Default:  Statistics 

are  not  printed. 


The  multichannel  means,  stand- 
ard deviations,  and  covariance 
matrix  (lower  triangular  por- 
tion) are  printed  out  for  each 
training  field  defined  in  the 
input  training  field  definition 
deck. 


TABLE  8-1.—  Continued 


Keyword 

OPTION 

HISTO 


OPTION 


OPTION 


Parameter  and 

default  values  Function 


NOCOVAR  No  training  subclass  or  train- 

ing field  statistics  are 
printed  out. 


N1,N2,N3,***,Nk 

l<k<30 

Default:  k=30 


N ' s are  integers  which  provide 
a list  of  channel  numbers  for 
use  in  the  histogram  options. 
The  channel  numbers  must  be 
from  the  set  designated  on  the 
CHANNELS  control  card.  Note : 
This  control  card  does  not 
initiate  the  histogram  option. 


HIST  A histogram  showing  frequency 

Default:  No  histograms  distribution  of  pixels  (reso- 

lution elements  or  radiance 
values)  is  printed  out  for 
every  training  field  and  every 
training  subclass  defined  in 
the  input  training  field 
definition  deck.  For  each 
subclass  (or  field) , a histo- 
gram is  provided  for  every 
channel  designated  on  the 
HISTO  control  card. 

HIST=C  A histogram  printout  is  pro- 

Default:  No  histograms  vided  for  every  training  sub- 

class defined  in  the  input 
training  field  definition  card. 
For  each  subclass,  a histogram 
is  provided  for  every  channel 


TABLE  8-1.—  Continued 


Keyword 


OPTION 


SIZE 


SIZE 


SIZE 


SIZE 


Parameter  and 

default  values  Function 

designated  on  the  HISTO  con- 
trol card. 


HIST=F  A histogram  printout  is  pro- 

Default:  No  histograms  vided  for  every  training  field 

defined  in  the  input  training 
field  definition  deck. 


XHIGH=K 

0<K<255 

Default:  XHIGH=220 


XLOW=L 

0<L<XHIGH 

Default:  XLOW=120 


YSIZ=J 
0<J<f (x) 
Default: 


max 

YSIZ=14 


XSIZ=K 

Default:  XHIGH-XLOW 


K is  an  integer  which  sets  the 

maximum  radiance  value  which 

will  be  histogrammed.  XHIGH 

becomes  X of  the  X-axis 

max 

of  the  histogram  plot. 

L is  an  integer  which  sets  the 

minimum  radiance  value  which 

will  be  histogrammed.  XLOW 

becomes  X . of  the  X-axis  of 
mm 

the  histogram  plot. 

J is  an  integer  which  sets  the 
number  of  increments  on  the 
Y-axis  of  the  histogram  plot; 
therefore,  it  is  the  height 
(number  of  print  lines)  of  the 
Y-axis.  Using  the  input  YSIZ, 
the  processor  will  determine 
the  Y-axis  scale  for  the 
histogram  plot  to  be 
f (x)m  ttYSfZ-D/YSiZ. 

Sets  the  range  which  will  be 
histogrammed;  maximum  range 
is  101. 


TABLE  8-1.—  Continued. 


Keyvord 

SPECTRAL 


OPTION 


Parameter  and 

default  values  Function 


M1'M2'M3'M4 

l<Mi<30 

Default:  4 subclasses 

per  spectral  plot; 
subclasses  1,  2,  3,  and 
4 on  the  first  plot;  5, 
6,  7,  and  8 on  the  sec- 
ond plot;  etc. 


M’s  are  integers  which  provide 
a list  of  from  one  to  four 
subclass  numbers  for  the  sub- 
classes which  are  to  be  plotted 
on  one  single  composite  spec- 
tral plot.  The  subclass  num- 
bers must  be  obtained  from  the 
set  of  subclasses  defined  in 
the  input  training  field 
definition  deck.  Subclass  1 
is  the  first  subclass  defined 
in  the  deck,  and  subsequent 
subclass  numbers  are  obtained 
by  sequentially  numbering  the 
subclasses  as  they  occur  in 
the  training  field  definition 
deck. 


SPECTRAL  A spectral  plot  is  printed 

Default:  Spectral  out  for  every  training  sub- 
plots for  subclasses  class  and  training  field 

defined  in  the  input  training 
field  definition  deck.  The 
plot  consists  of  the  subclass 
(or  field)  mean  radiance  value, 
mean  standard  deviation  (a) , 
and  mean  -a  plotted  versus  the 
channel  (spectral  band)  for 
every  channel  designated  on 
the  CHANNELS  control  card. 


Keyword 

OPTION 


OPTION 


SIZE 


DATAFILE 


TABLE  8-1.—  Continued. 


Parameter  and 

default  values  Function 


SPECTRAL=C 

Default:  Spectral 

plots  for  subclasses 


SPECTRAL=F 
Default:  Spectral 

plots  for  subclasses 


SPECBAS=I 

G<I<105 

Default:  SPECBAS^75 


UNIT=N,FILE=M 
Default:  N=3,M=1 


A spectral  plot  will  be 
printed  out  for  every  sub- 
class defined  in  the  input 
training  field  definition 
deck. 

A spectral  plot  will  be 
printed  out  for  every  field 
defined  in  the  input  training 
field  definition  deck. 

I is  an  integer  which  sets  the 
minimum  radiance  value  on  the 
Y-axis  of  the  spectral  plot 
(i.e. , ym^n) • The  processor 
has  a fixed  Y-axis  increment 
(3)  and  a fixed  number  of 
Y-axis  values  (50) . Using 
SPECBAS , the  processor  deter- 
mines the  Y-axis  range  to  be: 

Ym,„=SPECBAS,  Y =SPECBAS+150 . 
min  max 

N is  the  Fortran  logical  unit 

number  to  which  the  MSS  data 

tape  (DATAPE)  has  been 

assigned;  M is  the  file  number 

on  the  tape  to  be  processed. 

For  back-to-back  executions 

of  several  processors  if  the 

same  file  number  is  used, 

only  one  DATAFILE  control 

card  need  be  input. 


TABLE  8-1.—  Concluded 


Keyword 

STATFILE 

HED1 

HED2 

DATE 

COMMENT 

*END* 

$END* 


Parameter  and 
default  values 

UNIT=N,FILE=M 
Default:  N=1 ,M=1 


Any  60  characters 
beginning  in  column  11 
Default:  LYNDON  B. 

JOHNSON  SPACE  CENTER 

Any  60  characters 
beginning  in  column  11 
Default:  HOUSTON, 

TEXAS 

Any  12  characters 
beginning  in  column  11 
Default:  Current  date 

Any  60  characters 
beginning  in  column  11 
Default:  No  comments 

printed 

Blank 

Blank 


Function 

N is  the  logical  Fortran  unit 
number  to  which  the  SAVTAP 
file  has  been  assigned;  M is 
the  file  number  on  the  tape 
to  be  processed. 

Replaces  first  header  line 
with  the  indicated  characters 
in  the  parameter  field. 

Replaces  second  header  line 
with  the  indicated  characters 
in  the  parameter  field. 


Prints  the  indicated  12  char- 
acters in  the  right  corner  of 
the  heading  in  place  of  the 
current  date. 

Prints  a comment  line  using 
the  60  characters  found  in 
the  parameter  field. 

Signals  the  end  of  the  control 
cards. 

Signals  the  end  of  all  card 
input  for  the  processing 
function. 
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Figure  8-1.—  Functional  flow  chart  for  the 
STAT  processor. 
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Figure  8-3.-  Continued 
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Figure  8-4.-  Sample  program  listing  for  the  STAT  processor  using  defaults. 


9.  ITERATIVE  SELF-ORGANIZING  CLUSTERING  SYSTEM 
PROCESSOR  - ISOCLS 

A data  set  to  be  clustered  by  ISOCLS  is  defined  by  a class  con- 
sisting of  one  or  more  fields  from  the  MSS  data  tape  (DATAPE) . 

Any  number  of  classes  may  be  defined  and  clustered  as  individual 
data  sets  with  one  entry  into  the  ISOCLS  processor.  The  user  has 
control  over  the  maximum  number  of  clusters  allowed  per  class  via 
the  CLUSTERS  control  card.  However,  the  procedure  may  find  fewer 
clusters  than  the  maximum  allowed.  If  the  user  plans  to  use  the 
statistics  generated  from  the  clusters  in  later  CLASSIFY  or  SELECT 
runs,  he  or  she  must  exercise  control  over  the  maximum  number  of 
clusters.  The  SAVTAP  file  may  contain  statistics  for  up  to 
75  clusters  (or  subclasses) , but  only  60  can  be  used  for  proc- 
essing at  any  one  time  in  CLASSIFY  or  SELECT.  (Control  cards 
are  set  out  in  table  9-1.) 

The  clustering  procedure  used  in  ISOCLS  (ref.  3)  is  an  iterative 
procedure  which  assigns  each  MSS  data  sample  to  a specific 
cluster  by  determining  the  nearest  (in  absolute  distance)  cluster 
center  and  assigning  the  sample  to  it.  At  the  end  of  each 
iteration  (i.e.,  when  all  samples  have  been  assigned  to- a 
cluster) , new  cluster  centers  are  defined  by  computing  the 
mean  vector  for  the  data  samples  actually  assigned  to  the 
cluster.  After  the  initial  split  sequence,  the  iterative  pro- 
cedure terminates  when  the  user-specified  sequence  of  splits  and 
combinations  is  exhausted.  See  the  SEQUEN  control  card.  The 
criteria  for  splitting  or  combining  a cluster  are  user-specified 
by  the  STDMAX  and  DLMIN  control  cards. 

After  the  final  iteration,  the  covariance  matrix  for  each  cluster 
is  computed  and,  at  the  user's  option,  is  printed.  All  cluster 
statistics  for  the  class  are  saved  on  a scratch  file  until  all 


classes  have  been  clustered,  at  which  time  the  SAVTAP  file  is 
written.  The  chaining  of  clusters  for  the  final  map  printout 
is  performed,  if  the  user  has  requested  the  option  (see  CHAIN 
control  card,  table  9-1).  Statistics  for  the  chained  clusters 
are  not  computed. 

The  processor  allows  the  user  to  control  the  amount  of  line- 
printer  output  he  receives  via  the  KRN  and  MAP  control  cards. 

A final  map  of  the  clustered  data  is  always  output  along  with  a 
statistical  summary  of  the  clusters,  which  includes  mean  and 
standard  deviation  vectors,  total  points  assigned  to  each  cluster, 
and  intercluster  distances. 

Optionally  the  user  may  (1)  input  initial  cluster  centers  to 
hasten  the  clustering  process  or  (2)  allow  the  program  to  ini- 
tialize the  process  by  assigning  all  the  data  to  one  cluster, 
obtaining  the  mean  and  standard  deviation,  and  then  splitting. 
Initial  means  may  be  input  (1)  by  cards  (see  control  card  MEANS 
and  Cluster  Means  Deck,  section  3. 1.4. 3)  or  (2)  by  the  SAVTAP 
file  (see  control  card  STATFILE) . Input  of  the  initial  means 
causes  a scratch  file  to  be  written  so  that  the  means  can  be 
used  repeatedly.  Successive  classes  may  or  may  not  use  the 
same  means  to  initialize  cluster  centers  for  a new  class.  The 
control  card  MEANS  allows  the  user  to  request  cluster  centers 
from  the  last  class  to  be  read  from  the  scratch  file  and  used 
as  initial  centers  for  a new  class.  Input  of  a new  set  of  ini- 
tial means  will  cause  the  scratch  file  to  be  overwritten  with 
new  cluster  centers. 

See  the  functional  flow  chart  for  ISOCLS  (fig.  9-1) , which  shows 
when  each  function  is  performed. 


The  clustering  algorithm  embodied  in  ISOCLS  is  detailed  step  by 
step  in  the  following  subsections.  This  entire  procedure  is 
repeated  for  each  class  (or  data  set) . 


9.1  PROCEDURES 

9.1.1  NOTATION  DEFINITIONS 


Symbol 

Fortran 

name 

Definition 

CLD.. 

CLD(I,J) 

Intercluster  distance  between  clusters  i 
and  j . 

d[Xk,M(l,J 

DIST 

Distance  from  the  data  point  k to  the 
center  of  cluster  i. 

DLMIN 

DLMIN 

Threshold  value  for  combining  clusters. 

ISTOP 

ISTOP 

Maximum  number  of  initial  split 
iterations . 

CHNTHS 

CHNTHS 

Chaining  threshold  value. 

LNCAT 

LNCAT 

INCAT 

Number  of  existing  clusters  at  a given 
time. 

N ( i) 

N (I ) 
DN  (I ) 

Total  number  of  data  points  assigned 
to  cluster  i. 

SEQUEN 

SEQUEN 

User-specified  sequence  of  split  and 
combine  iterations. 

NMIN 

NMIN 

Minimum  number  of  data  points  allowed 
per  cluster  for  both  the  initial  split 
iterations  and  for  one  through  (NOSEQ-1) 
SEQUEN  iterations . 

PMIN 

PMIN 

Minimum  of  (PMIN+NOFEAT)  number  of 
data  points  allowed  per  cluster  for 
the  NOSEQfcT:  SEQUEN  iteration. 

Symbol 

Fortran 

name 

Definition 

NOSEQ 

NOSEQ 

Maximum  number  of  SEQUEN  iterations. 

ISEQ 

ISEQ 

Number  of  SEQUEN  iterations  at  a given 
time. 

NOPEAT 

NOFEAT 

Number  of  coordinates  (channels)  in  a 

data  vector. 

STDM/lX 

STDMAX 

Threshold  for  splitting  clusters. 

* 

X 

C(I,K) 

Data  vector  k,  C (I , K) = (X^,X2 , • • • , 
XNOFEAT) k 

MEANS (J,I) 

Mean  of  the  j th  coordinate  of  the 
ith  cluster. 

AVP ( J , I ) 

Temporary  summing  variable  for  the  cal- 
culation of  the  standard  deviation  of 
the  j th  coordinate  of  the  ith  cluster. 

m!1) 

3 

AMN (J,I) 

Summing  variable  for  computation  of 

new  means.  After  all  data  have  been 
assigned  to  clusters  on  each  iteration, 
AM.N(J,I)  is  the  new  mean  of  the  j th 
coordinate  of  the  itfc  cluster. 

STDEV ( J/ I) 

Standard  deviation  of  the  j th  coordi- 

j 

nate  of  the  ith  cluster. 

9.1.2  INITIALIZING  THRESHOLD  VALUES 

Initialize  threshold  values  for  splitting  clusters  (STDMAX) , 
combining  clusters  (DLMIN) , and  deleting  clusters  (NMIN  and 
PM IN) . Then  begin  the  following  iterative  procedure. 


9.1.3  ITERATIVE  PROCEDURE 


9.1. 3.1  Classify  and  Calculate  New  Statistics 

Assign  each  data  point  to  a cluster  and  at  the  same  time  collect 
the  means,  standard  deviations,  and  point  counts  of  the  newly 
developing  clusters.  If  zero  clusters,  set  i = 1 and  go  to 
iteration  b.  If  more  than  zero  clusters,  go  to  iteration  a. 


a.  Assign  the  data  point  X^  - ^X1'X2' * * * 'XNOFEAT^k  to  t*ie 

xth  cluster  if  d£xk,y^J  < d^X^y^J  for  all  j ^ i,  where 

d[xk,u(i>] 


is  defined  as 


b. 


c. 


d. 


e . 


NOFEAT 


d[Xk,y(i)]  = ^ 


X - u(i) 
Xjk 


j = l 

N (i)  = N (i)  + 1 

„(i)  _ N (i)  - lw(i)  1 

Mj NTTr~Mj  + NlTTXjk 

(i)  _ N (i)  - l.,(i)  __l_x2 

Yj nTH  Yj  + N(i)  Ajk 


•r  • I-;1'  • [■!u] 


2/1/2 


j = 1, NOFEAT 


(9-1) 

(9-2) 

(9-3) 

(9-4.) 

(9-5) 


Return  to  step  a and  repeat  iterations  a through  e until  all 
data  points  have  been  classified. 


9 . 1 . 3 . 2 Delete  Clusters 

For  the  initial  split  iterations  and  one  through  (NOSEQ-1) 
user-specified  SEQUEN  iterations,  delete  all  clusters  which  have 
fewer  than  NMIN  members.  For  the  NOSEQtft  user-specified  itera- 
tion (last  user-input  sequence) , delete  all  clusters  which  have 
fewer  than  PMIN  members.  A cluster  is  deleted  simply  by  removing 
the  statistics  for  that  cluster  and  reducing  the  number  of  clus- 
ters (specified  by  LNCAT ) accordingly. 


ff? 


9.1.3. 3 Test  for  Completion 

If  this  is  not  the  last  iteration,  proceed  to  9. 1.3. 4.  If  this 
is  the  last  iteration  and  no  clusters  were  deleted,  the  procedure 
is  finished.  If  one  or  more  clusters  were  deleted  for  having 
less  than  PMIN  members,  go  back  to  9. 1.3.1  and  reassign  the  data 
to  the  clusters  obtained  from  iteration  (NOSEQ-1) . 

9. 1.3. 4 Determine  Type  of  Iteration 

Determine  whether  this  is  to  be  a split  iteration  or  a combine 
iteration  and  proceed  to  the  appropriate  step. 

The  sequence  of  iterations  will  be  as  follows: 

SSSS  CCSCSC 

ISTOP  SEQUEN 
and/or 
PERCENT 


where 

S = split  iteration 
C = combine  iteration 

The  beginning  sequence  of  split  iterations  is  terminated  either 
(1)  when  the  standard  deviations  for  the  user-input  percentage 
of  clusters  (see  PERCENT  control  card,  table  9-1)  are  less  than 
the  STDMAX  threshold  parameter  or  (2)  when  ISTOP  iterations  have 
been  reached.  At  that  point,  the  type  of  iteration  (split  or 
combine)  and  number  of  iterations  (NOSEQ)  are  determined  by  the 
SEQUEN  parameter. 

The  initial  split  iterations  are  for  the  automatic  initializa- 
tion of  cluster  centers  in  the  event  they  are  not  input.  The 
sequence  is  shortened  considerably  if  initial  cluster  centers 
are  input. 


9. 1.3. 5 Split  Clusters 

A cluster  is  split  along  the  j th  coordinate  (1)  if  the  j th 
coordinate  has  the  maximum  standard  deviation  for  the  cluster, 
(2)  if  the  standard  deviation  along  the  jth  coordinate  is 
greater  than  the  STDMAX  threshold  parameter,  and  (3)  if  the 
cluster  has  more  than  2(NMIN+1)  data  points. 


If  conditions  (1)  through  (3)  are  met,  two  new  clusters  are 
created  and  the  parent  cluster  is  deleted.  A cluster  is  created 
merely  by  defining  its  centers  (means)  for  each  coordinate.  If 
the  i th  cluster  is  split  in  the  j th  coordinate,  the  two  new 
clusters  will  have  centers  at  '^2*^  ' * * * • ^ ± a • *’ ' • ^nofeAt]  ' 

where  a will  normally  be  Oj  ^ but  can  b®  a user-input  constant 
(see  SEP  control  card).  On  a given  split  iteration,  if  the 
maximum  number  of  clusters  (CLUSTER)  has  not  been  reached,  all 
clusters  having  a standard  deviation  greater  than  the  STDMAX 
parameter  will  be  split.  To  ensure  that  the  clusters  with  the 
largest  standard  deviations  receive  the  highest  priority  for 
splitting,  when  2 x LNCAT  > CLUSTER,  the  standard  deviations 
are  ordered  along  the  jifc  coordinate  in  descending  order. 

Return  to  9. 1.3.1  after  splitting  clusters. 


9 . 1 . 3 . 6 Combine  Clusters 

Two  clusters  are  combined  if  the  distance  between  them  is  less 
than  the  DLMIN  threshold  parameter.  The  distance  between 
clusters  i and  j is  calculated  as 


CLDij 


(i 
k__ 
(i 


(9-6) 


If  CLD. . < DLMIN  and  CLD . . = MIN (CLD . . ) for  all  i = 1, LNCAT  and 
j = 1, LNCAT  for  all  i j-  j,  clusters  i and  j will  be  merged  to  | 
form  a new  cluster  L with  means 


; k = 1, NOFEAT 


(9-7) 


(L)  _ N(i)p^i)  + N(j)^j) 

Wk  N (i)  + N ( j ) 

The  clusters  i and  j are  deleted.  The  new  cluster  L is  not 
considered  as  a candidate  for  merging  with  any  other  cluster 
on  the  iteration  in  which  it  was  formed.  Return  to  9. 1.3.1 
after  combining  clusters. 

9.1.4  CHAINING 

A final  optional  step  in  the  clustering  procedure  groups  all 
clusters  which  have  intercluster  distances  less  than  the  chain- 
ing threshold  (CHNTHS)  to  form  one  cluster.  The  chaining  pro- 
cedure was  adopted  because  the  minimum  variance  criterion  used 
in  the  iterative  procedure  above  tends  to  group  the  data  into 
spherical  (or  ellipsoidal)  groupings  with  Gaussian  distributions. 
This  type  of  grouping  is  certainly  a natural  grouping  and  would 
quite  often  be  completely  satisfactory. 

Some  natural  groupings  of  the  data  are  odd  shaped  and  cannot  be 
approximated  by  Gaussian  distributions.  Two  examples  are  given 
in  figure  9-2.  At  the  end  of  the  sequence  of  split  and  combine 
iterations,  groupings  of  the  type  in  figure  9-2  are  likely  to  be 
separated  into  subclusters  as  illustrated  in  figure  9-3.  The 
chaining  algorithm  will  indicate  that  the  subclusters  1,  2,  and 
3 (fig.  9-3)  could  be  grouped  into  one  composite  cluster; 
likewise,  subclusters  4,  5,  6,  and  7 could  be  grouped  together 
to  form  one  cluster. 

The  algorithm  scans  the  intercluster  distance  table  (CLD)  and 
begins  a chain  with  the  first  appearance  of  two  clusters  within 
a distance  of  CHNTHS  units.  Once  a subcluster  is  in  the  chain, 
all  clusters  which  are  within  CHNTHS  units  of  the  subcluster 
are  added  to  the  chain.  See  figure  9-4. 


k 


f* 


* 


The  statistics  (means,  standard  deviations,  and  covariance 
matrices)  of  the  clusters  resulting  from  chaining  are  not  cal- 
culated by  the  program  because,  in  many  cases,  the  chained 
cluster  cannot  be  represented  by  a Gaussian  distribution. 

There  are,  of  course,  instances  where  one  can  safely  combine 
those  subclusters  that  are  chained  by  the  program  into  one  com- 
posite (Gaussian)  cluster.  For  example,  subclusters  1,  7 and 
3 in  figure  9-5  can  safely  be  combined  into  one  final  cluster. 
This  is  indicated  by  the  fact  that,  pairwise,  these  three  sub- 
clusters are  all  close  to  one  another.  In  this  case,  the  fol- 
lowing formulas  (ref.  3)  can  be  used  iteratively  to  compute  the 
composite  statistics. 


Assuming  that  two  clusters  (n^,m^,C^)  and  (n2 ,0.^)  are  to  be 
considered  as  one  cluster  (n,m,C) , where  all  n,  m,  and  C are 
the  number  of  points,  the  mean  vectors,  and  the  covariance 
matrices,  respectively,  and  m is  the  transpose  of  m then 


9.2  INPUT  FILES 

An  MSS  data  tape  (DATAPE)  must  be  input  to  the  ISOCLS  processor. 
The  tape  assignment  defaults  to  logical  unit  C (Fortran  unit  3) ; 
however,  by  input  of  the  DATAFILE  control  card,  the  user  may 
assign  any  available  logical  unit.  (See  table  4-1  for  file 


/S3 


assignments  and  section  3.2,  MSS  Image  Data  Tapes,  for  further 
information  on  format.) 

9.3  OUTPUT  FILES 

Statistics  are  output  by  ISOCLS  to  the  SAVTAP  file  (section  4.1). 
The  logical  unit  must  be  assigned  either  to  tape  or  to  Fastrand 
for  this  file.  The  file  should  be  output  to  tape  and  saved  if 
the  statistics  may  be  required  for  further  analysis  in  later  runs. 
Multiple  files  may  be  saved  on  the  same  tape.  The  tape  assign- 
ment defaults  to  logical  unit  A;  but,  by  input  of  the  STATFILE 
control  card,  the  user  may  assign  any  available  logical  unit. 

(See  table  4-1  for  file  assignments  and  STATFILE  control  card, 
table  9-1,  for  further  information.) 

A cluster  map  tape  (MAPFIL)  may  be  generated  optionally  for  dis- 
playing the  results  of  the  clustering  on  the  Bendix  100  or  the 
PMIS  DAS.  The  FORMAT  control  card  initiates  the  option  and 
names  the  desired  format  of  the  tape.  Logical  unit  N (Fortran 
unit  16)  should  be  assigned  to  a nine- track  tape  drive  when 
this  option  is  exercised  (see  section  5.1). 

9.4  SCRATCH  FILES 

The  program  dynamically  assigns  random  access  drum  storage  for 
scratch  files.  ISOCLS  uses  the  drum  storage  for  temporary 
storage  of  cluster  statistics,  the  data  to  be  clustered,  and 
the  classification  of  each  pixel. 


9 . 5 CARD  INPUT 


9.5.1  PROCESSOR  CARD 

The  processor  keyword  is  left  justified  starting  in  column  1. 

For  example, 

$ISOCLS 

This  card  directs  the  monitor  routine  to  call  the  ISOCLS  proc- 
essor and  causes  all  routines  used  by  the  ISOCLS  processor  to 
be  loaded  into  the  system. 

9.5.2  SYSTEM  CARD  DECKS 

The  processor  will  read  a cluster  MEANS  deck  in  the  format 
defined  in  section  3.1.4. 3.  The  deck  may  be  used  to  initial- 
ize cluster  centers  for  the  clustering  procedure. 

9.5.3  CONTROL  CARDS 

Control  cards  allow  the  user  to  input  various  options.  They  are 
identified  by  a keyword  that  is  left  justified  in  columns  1 
through  10  of  the  card,  with  parameter  values  or  additional 
keywords  in  columns  11  through  72.  These  control  cards  may  be 
in  any  order,  but  they  must  be  the  first  cards  after  the  proc- 
essor card  $ ISOCLS . Table  9-1  lists  all  available  options, 
along  with  their  parameter  values. 

9.5.4  CLASS  AND  FIELD  DEFINITIONS 

A CLASSNAME  card,  followed  by  at  least  one  field  definition 
card,  must  immediately  follow  the  *END*  control  card.  The  for- 
mats for  these  cards  are  defined  in  section  3.1.3. 


The  pixels  from  all  fields  for  one  class  are  extracted  from  the 
MSS  data  tape  (DATAPE)  and  stored  on  a high-speed  drum  file.  The 
data  from  all  fields  for  one  class  are  clustered  as  one  data  set. 
The  statistics  for  all  clusters  in  that  class  are  saved  on  a 
scratch  file,  and  the  next  class  is  clustered.  When  all  classes 
have  been  clustered,  the  statistics  are  written  on  the  SAVTAP  file 
(see  appendix  K for  sample  execution) . The  SETUP  routine  may 
be  entered  after  each  class  to  change  parameter  values.  The 
input  for  definition  of  classes  and  fields  is  explained  below. 


Write  the  SAVTAP  file 
after  five  classes  have 
been  clustered. 


WHEAT  1 clustered  as 
one  data  set. 


Enter  SETUP  routine 
again  for  new  parameter 
values. 


) $ISOCLS 
[ — ► CLASSES  5 

1 | . 

| ; (Other  control  cards) 

♦END* 

CLASSNAME  WHEAT 1 

FIELD1 

1 FIELD 2 

' FIELD3 
1 CLASSNAME  WHEAT 2 

2 FIELD4  I 

) ( FIELD5  ♦ 

— •-  $END* 

’ ( 

i • (New  parameter  values) 


Now  write  statistics 
file. 


♦END* 

/ CLASSNAME 
| FIELD6 
IFIELD7 
(FIELDS 
( CLASSNAME 
\ FIELD9 


CLASSNAME 

FIELD10 

$END* 

$ (Next  processor) 


NONWH1 


N0NWH2 

♦ 


NONWH3 


♦ 


9.5.5  DECK  SETUP 


The  deck  setup  for  the  ISOCLS  processor  is  given  in  figure  9-6; 
ISOCLS  may  be  run  independently,  as  shown  in  (a) , or  back  to 
back  with  CLASSIFY  and  DISPLAY,  as  shown  in  (b) . The  @ in 
column  1 indicates  the  master  punch  for  the  Univac  system, 
which  is  the  7-8  multipunch. 

9.6  CARD  OUTPUT 

A module  STAT  deck  (see  section  3. 1.4.1)  may  be  punched  and  used 
as  an  interface  between  ISOCLS  and  SELECT  or  CLASSIFY.  This 
option  is  exercised  via  the  OPTION  PUNCH  control  card. 

9.7  SAMPLE  COMPUTER  RUNS 

The  two  computer  listings  in  figures  9-7  and  9-8  demonstrate, 
respectively,  the  use  of  all  defaults  and  the  use  of  all  options 
for  the  ISOCLS  processor. 

Run  1 (fig.  9-7)  uses  all  defaults  except  CHANNELS  and  clusters 
a large  data  set  from  flight  line  C-l.  The  data  set  is  given 
the  class  name  ALL,  and  it  will  have  a maximum  of  60  clusters. 
The  SAVTAP  file  will  have  only  one  class.  Therefore,  if  it  is 
used  in  CLASSIFY,  categories  cannot  be  defined  inasmuch  as  the 
standard  classifier  would  be  used.  Output  from  run  1 is  not 
shown. 

Run  2 (fig.  9-8)  demonstrates  the  use  of  all  options  in  ISOCLS. 
Training  fields  for  two  classes  are  clustered,  producing 
20  clusters  for  class  WHEAT  and  20  clusters  for  class  NONWH. 

Run  3 (fig.  9-9)  demonstrates  the  use  of  most  options  in  ISOCLS. 
A LAC IE  segment  is  clustered  and  the  image  output  to  a file  in 
Universal  format. 
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9.8  RESTRICTIONS 

The  ISOCLS  processor  uses  the  FH432  and  FH1782  drums  for  tem- 
porary scratch  files.  There  are  approximately  1 310  000  words 
of  storage  available  on  these  drums.  The  data  to  be  clustered 
for  one  class  are  stored  on  this  file,  along  with  other  informa- 
tion. To  compute  the  maximum  number  of  pixels  per  class,  use 
the  following  formula. 


Maximum 

pixels 


1 310  000  - 30 


number  of  channels  + 1 


(9-9) 

The  maximum  number  of  clusters  per  class  is  60,  and  the  maximum 
number  of  channels  is  30.  The  covariance  matrices  for  all 
clusters  in  one  class  must  be  stored  in  core  at  one  time.  They 
are  stored  in  an  array  dimensioned  11  500.  The  following  formula 
may  be  used  to  see  if  enough  storage  is  available  for  the 
covariances. 


11  500  > 


number  of  [number  °f fennel’s* 


~ clusters  ^channels 


v 


'-4 


(9-10) 


% ✓ 


[number  of  f 

/number  of' 

2 

\ , ./number  of  , 

- 1800 | 

H 

\ classes  l 

\ channels  , 

) + 3(  channels  * 2)J 

i 

9.9  DIAGNOSTIC  MESSAGES 
Message 

CHANNELS  CANNOT  BE  CHANGED  UNTIL 
THIS  EXECUTION  OF  ISOCLS  IS 
COMPLETED. 


Explanation 

The  channels  to  be  used  should 
be  set  in  the  first  set  of 
control  cards  input  after  the 
ISOCLS  card.  That  set  of 
channels  will  be  used  for  all 
classes.  If  the  user  attempts 
to  input  a CHANNELS  card  into 
the  SETUP  routine  on  a later 
entry,  the  card  will  be 
ignored. 


4 


t 

♦ 


Message 

NO.  OF  CLASSES  CANNOT  BE  CHANGED 
UNTIL  THIS  EXECUTION  OF  ISOCLS 
IS  COMPLETED. 


END-OF-TAPE  REACHED  BEFORE 
END-OF-FIELD. 

INPUT  ERROR  - A CLASSNAME  CARD 
MUST  BE  INPUT  BEFORE  A GROUP  OF 
FIELDS. 

NO.  OF  PIXELS  TO  BE  UNPACKED  PER 
SCAN  EXCEEDS  THE  DIMENSION  LIMIT 
OF 

■i  ■ 1 i ■ • 

TOO  MUCH  DATA  REQUESTED  — 

PIXELS  * (CHANNELS  + 1)  CANNOT 
EXCEED 

■ ,■■■!.  . • 

STORAGE  REQUIRED  FOR  FIELD  DEFI- 
NITION INFORMATION  EXCEEDS  THE 
DIMENSION  LIMIT  OF 


DIMENSION  LIMITS  EXCEEDED  IN 

ISOCLS  BY  . REDUCE 

CHANNELS  OR  MAX.  CLUSTERS. 

DIMENSION  LIMIT  OF  FOR 

COVARIANCES  EXCEEDED. 


Explanation 

The  number  of  classes  to  be 
clustered  must  be  input  only 
in  the  first  set  of  control 
cards  input  after  the  ISOCLS 
card.  If  the  user  attempts 
to  change  this  parameter,  the 
input  will  be  ignored. 

A field  has  been  defined 
beyond  the  limits  of  the  MSS 
DATAPE. 

See  section  9.5.4  on  defining 
classes  and  fields. 

Decrease  the  number  of  chan- 
nels or  pixels  per  scan  in 
the  field. 

Drum  file  will  not  hold  all 
of  the  data  for  one  class. 
Reduce  channels  or  size  of 
fields . 

Reduce  the  number  of  fields. 

All  vertices,  names,  and  rec- 
tangular coordinates  are  saved 
for  each  field.  The  user  has 
exceeded  storage . 

The  user  has  exceeded  storage. 
The  number  of  channels  or  maxi- 
mum clusters  per  class  should 
be  reduced. 

Same . 


A? 


Message 

WRITE  ON  UNIT TERMINATED 

ABNORMALLY.  DAS  TAPE  NOT 
CREATED.  ISTAT  * 


END-OF-TAPE  ON  UNIT 
LAST  LINE  WRITTEN. 


INVALID  INPUT  CARD  IGNORED. 


Explanation 

Printed  by  subroutine  DSTAPE. 
User  should  resubmit  the  job 
with  a different  output  tape. 
This  diagnostic  message  indi- 
cates that  either  a bad  type 
was  being  used  or  a tape 
drive  error  occurred. 

Printed  by  DSTAPE.  The  end 
of  reel  has  been  encountered 
while  attempting  to  write 
the  MAPFIL  tape. 

Printed  by  subroutine  SETUP 7 . 
Check  table  9-1  for  correct 
spelling  of  keywords  for  card 
input  and  make  sure  the  key- 
word is  left  justified  in  the 
field. 


TABLE  9-1. - ISOCLS  PROCESSOR  OPTIONS  AND  CONTROL  CARDS 


Keyword 

(a) 


Parameter  and 
default  values 
(b) 


CHANNELS  DATAK^  ,C2  ,C3 , • • • ,Ck, 

STAT-A^  ^2  , * * * # Aj^ 

k<number  of  channels 
on  SAVTAP<30 
Default:  None 


Function 

C's  are  integer  channel  numbers 
that  (1)  will  be  used  in  clus- 
tering and  (2)  refer  to  the  MSS 
data  tape  (DATAPE) . A's  are 
integer  channel  numbers  that 
(1)  will  be  the  starting  vec- 
tors (initial  means) , (2)  refer 
to  the  SAVTAP  file , and  (3)  must 
be  a subset  of  the  channels  on 
the  SAVTAP  file.  The  same 
channels  must  be  used  through- 
out one  execution  of  ISOCLS. 

If  a cluster  MEANS  deck  is  in- 
put, the  channels  on  this  card 
must  be  a subset  of  the 
channels  in  the  MEANS  deck. 


OPTION  ORDER 

Default:  The  color 

keys  will  be  ordered 
according  to  cluster 
numbers . 


The  color  keys  on  the  MAPFIL 
tape  will  be  ordered  according 
to  greenness.  See  section  5.1 
for  further  details  of  color 
keys . 


OPTION  PUNCH=N 

Default:  If  PUNCH  is 

omitted,  no  cards  are 
punched;  if  N is  omit- 
ted, it  defaults  to  1. 


Punches  the  means  and  covariance 
matrix  for  each  cluster  in  the 
module  STAT  deck  format  defined 
in  section  3.1.4. 1.  N~1  punches 

module  STAT  deck;  N=2  punches 


aThe  keyword  must  be  left  justified  in  card  columns  1 through  10. 

bThe  parameter  and  default  values  are  in  card  columns  11  through  72 
in  any  column  past  10) . 


(beginning 


- * st*  a?  ? 


TABLE  9-1.-  Continued 


Keyword 

OPTION 

SEQUEN 

OPTION 

SYMBOLS 

FORMAT 

FORMAT 

OPTION 


Parameter  and 

default  values  Function 

ERIPS  interface  deck;  and  N=3 
punches  both  decks . 

STATS  Prints  the  covariance  matrix 

for  each  cluster. 


AA* • *A 
Default:  SC 


ERCOMP 


A represents  the  sequence  of  S 
and  C characters  used  for  iter- 
ation control  after  the  initial 
split  sequence.  A maximum  of 
19  characters  may  be  input. 

Prints  an  error  criterion  for 
each  iteration. 


Si '^2 'S3 ' * * * Symbols  used  to  identify 

Default:  1,2, •••9,  clusters  in  the  printout. 

A,B,***Z,%,#, A/ 

$ » @ / = » 0 1 • 1 1 

, ; , , ' , comma , period , 
blank , 


UNIVERSAL 
Default:  Output 

MAPFIL  tape  is  not 
generated. 

LARSYS 

Default:  Output 

MAPFIL  tape  is  not 
generated. 

CLUSTER 

Default:  If  the 

FORMAT  control  card 


Generates  the  output  cluster 
MAPFIL  tape  in  Universal  for- 
mat (see  section  5.1  for 
further  information) . 

Generates  the  output  cluster 
MAPFIL  tape  in  LARSYS  format. 


The  output  cluster  MAPFIL  tape 
will  contain  the  cluster  number 
to  which  the  corresponding 


TABLE  9-1.—  Continued 


Keyword 


NMIN 


DLMIN 


PMIN 


SEP 


DATAFILE 


Parameter  and 

default  values  Function 


is  input,  the  output 
cluster  MAPFIL  tape 
will  contain  the 
mean  vector  of  the 
cluster  to  which  the 
corresponding  pixel 
was  assigned. 


pixel  was  assigned.  When 
selecting  this  option,  the 
FORMAT  control  card  must  be 
input  also. 


N 


X 

Default:  3.2 


Deletes  any  cluster  with  fewer 
than  N members  on  the  first 
through  next-to-last  iteration 
(see  section  9.1.1). 

On  a combine  iteration,  com- 
bines any  two  clusters  whose 
means  are  closer  than  X units. 


N 


X 

Default:  Maximum  of 

the  channel  standard 
deviations  in  the 
cluster 

UNIT=N , FILE=M 
Default:  N=3,M=1 


Deletes  any  cluster  with  fewer 
than  N members  on  last  itera- 
tion (see  section  9.1.1). 

When  splitting  a cluster, 
separates  the  new  clusters  by 
a distance  of  X units. 


N is  the  Fortran  logical  unit 
number  to  which  the  MSS  data 
tape  (DATAPE)  has  been  assigned 
M is  the  file  number  on  the 
tape  to  be  processed.  For 
back-to-back  executions,  if 


TABLE  9-1.-  Continued. 


Keyword 


STATFILE 


I STOP 


PERCENT 


Parameter  and 

default  values  Function 

the  same  data  file  is  to  be 
processed  throughout  the  execu- 
tion, only  one  DATAFILE  card 
need  be  submitted. 


INPUT/UN IT=N , F ILE=M , 
OUTPUT/UNIT=L,FILE=S 
Default:  No  defaults 

for  INPUT;  L=1 , S=1  for 
OUTPUT 


N is  the  Fortran  logical  unit 
number  to  which  the  SAVTAP 
file  containing  the  initial 
means  has  been  assigned;  M is 
the  file  number  of  the  tape 
to  be  processed;  L is  the 
Fortran  logical  unit  number 
to  which  the  SAVTAP  file  con- 
taining the  generated  statis- 
tics will  be  output;  S is 
the  file  number  on  the  tape 
for  saving  the  clustered 
statistics.  (Input  of  initial 
means  from  SAVTAP  file  is 
illustrated  in  section  16, 
figure  16-4.) 


N 

Default:  10 

N 

Default:  80 


A maximum  of  N iterations  is 
performed  in  the  initial  split 
sequence. 

N,  an  integer  number,  is  the 
test  variable  for  the  percent- 
age of  stabilized  clusters  with 
standard  deviations  less  than 
the  threshold  parameter  STDMAX 
in  the  initial  split  iteration 
sequence . 


Keyword 

STDMAX 

CLASSES 

CLUSTERS 

KRN 

MAP 

CHAIN 

SUBCLASS 


TABLE  9-1.—  Continued. 


Parameter  and 

default  values  Function 


X 

Default:  4.5 


On  a split  iteration,  splits 
any  cluster  whose  maximum 
standard  deviation  is  greater 
than  X units. 


N 

Default:  1 


N 

Default:  60 

N 

Default:  20 


Number  of  classes  to  be  clus- 
tered (see  section  9.5.4  for 
defining  classes) . 

Maximum  number  of  clusters  per 
class;  N must  be  <60. 

Prints  out  a summary  of  the 
clusters  at  every  Nth  iteration. 


N Prints  out  a map  of  the  clus- 

Default:  20  tered  data  along  with  the  sum- 

mary for  every  Nth  iteration. 

A final  cluster  map  is  printed 
regardless  of  this  parameter. 


X Chains  all  clusters  within  X 

Default:  Chaining  not  units  of  each  other  to  form 

performed  one  cluster.  Chaining  of  clus- 

ters affects  only  the  final  map 
printout  and  MAPFIL  tape. 


C1'C2'C3' 

k<60 


fC, 


Default:  All  sub- 

classes/clusters on 


SAVTAP  file  will  be 


used  in  initializing 
the  clustering. 


C’s  are  integer  subclass 
or  cluster  numbers  that 
(1)  will  be  used  in  the  ini- 
tial means,  (2)  refer  to  the 
SAVTAP  file,  and  (3)  must  be 
a subset  of  the  subclasses  or 
clusters  on  the  SAVTAP  file. 


TABLE  9-1.—  Continued 


Keyword 

MODULE 

MEANS 


MEANS 


HED1 

HED2 

DATE 


Parameter  and 
default  values 

Blanks 


CARDS 

Default:  Clustering 

procedure  is  automati- 
cally initialized  if 
this  deck  or  MEANS 
file  is  not  input. 

FILE 

Default:  Cluster  cen- 

ters are  automatically 
initialized  if  this 
card  or  the  MEAN  card 
deck  is  not  input. 


Any  60  characters 
beginning  in  column  11 
Default:  LYNDON  B. 

JOHNSON  SPACE  CENTER 

Any  60  characters 
beginning  in  column  11 
Default:  HOUSTON , 

TEXAS 

Any  12  characters 
beginning  in  column  11 
Default:  Current  date 


Function 

Initializes  the  reading  of  the 
module  STAT  deck  that  imme- 
diately follows  this  card. 

Initializes  input  of  the  clus- 
ter MEANS  deck  defined  in 
section  3.1.4. 3.  This  deck 
is  used  to  initialize  cluster 
centers  for  the  clustering 
procedure. 

Indicates  means  for  initial 
clusters  have  been  input  pre- 
viously from  cards  and  stored 
on  file.  The  same  initial 
means  are  to  be  used  again  for 
initializing  the  process  for  a 
new  data  set. 

Replaces  first  header  line 
with  the  indicated  characters 
in  the  parameter  field. 

Replaces  second  header  line 
with  the  indicated  characters 
in  the  parameter  field. 


Prints  the  indicated  12  charac- 
ters in  the  right  corner  of  the 
heading  in  place  of  the  cur- 
rent date. 


TABLE  9-1.—  Concluded 


Parameter  and 
Keyword  default  values 

COMMENT  Any  60  characters 

beginning  in  column  11 
Default:  No  comments 

printed 

*END*  Blank 

$END*  Blank 


Function 

Prints  a comment  line  using 
the  60  characters  found  in 
the  parameter  field. 

Indicates  the  end  of  control 
cards. 

Indicates  the  end  of  all 
classes  to  be  clustered  for 
this  set  of  control  cards. 

The  SETUP  routine  will  be 
reentered  to  read  new  control 
cards  for  the  next  class 
until  all  classes  have  been 
clustered. 


Figure  9-1.—  Functional  flow  chart  for  the  ISOCLS  processor 


(a)  The  boomerang- shaped 
cluster. 


(b)  The  donut-shaped 
cluster. 


Figure  9-2.—  Odd-shaped  clusters. 


(a)  Subclustering  of  the  (b)  Subclustering  of  the 

boomerang-shaped  cluster.  donut-shaped  cluster. 

Figure  9-3.—  Breaking  up  of  the  clusters  into  subclusters. 


0.0  7.5  6.2  3.2  11. 

7.5  0.0  3.1  5.6  3. 

3 6.2  3.1  0.0  3.1  6.3  CHNTHS 

4 3.2  5.6  3.1  0.0  9.7 

5 11.8  3.0  6.3  9.7  0.0 

(b)  Intercluster  distance  table. 

Figure  9-4.—  Example  of  chaining. 


(Back  of  dtck) 


(■ 


♦END* 


Optional  — see  j / — — — — 

sections  3. 1.4. 3 /(cluster  MEANS  deck) 
and  9.4.3.  U 

fu 


(Control  cards) 


6 


(> 


0- 


( 


6 


$ISOCLS 


@ XQT  LARSAA 


TRW  Z 


IN  Z 


TRW  Z 


@ XQT  CUR 


@N  ASG  N= (MAPFIL  number) 


9 ASG  A= (SAVTAP  number) 


o 


(• 


(' 


0 


) Needed  only  if  FORMAT 
/ card  is  input. 

■j  SAVTAP  file  may  be 
J assigned  to  tape 
) or  Fastrand. 


@ ASG  C= (MSS  DATAPE  number) 


@ ASG  Z= (program  tape  number) 


@N  MSG  TAPES  2 FASTRAND  3 FH432  1 FHL782  1 


@Z  RUN  — 


(Front  of  d«ck) 


(a)  For  independent  execution. 


Figure  9-6.—  Deck  setup  for  the  ISOCLS  processor. 


9^2-9-" 

/73 


(Bock  of  dock) 


^*END* 

Optional  — see  ( / 

sections  3.1. 4.3)/  (Cluster  MEANS  deck) 
and  9.4.3.  U 

( (Control  cards) 


SISOCLS 


0 XQT  LARSAA 


TRW  Z 


IN  Z 


TRW  Z 


0 XQT  CUR 


r 

0N  ASG  N= (MAPFIL  number)  ^Optional 


0 ASG  A,B 


0 ASG  C= (MSS  DATAPE  number) 


0 ASG  Z= (program  tape  number 


0N  MSG  TAPES  3 FASTRAND  2 


0Z  RUN  — 


The  SAVTAP  (A)  and  MAPTAP 
(B)  files  must  be  assigned 
to  either  tape  or  Fastrand 


(Front  of  dock) 


(b)  For  execution  back  to  back  with  CLASSIFY  and  DISPLAY. 


Figure  9-6.—  Continued, 


(Bock  of  dock) 


WINTER 


6Z  *>Ufv  L73179,TF7,H4,  1659,  <36 19 ,C  , 20. 5 
6 A SG  A 

§ ASG  C=V03662 
i ASG  Z=V01366 

* XCT  CUR 
TRW  Z 

IN  Z 
TRW  Z 

§ XCT  LARSAA 

$ 1 sects 

CHANNELS  DATA=1 ,2 , 3 ,4 ,5 ,6 

♦ END* 

CLASSNAME  ALL 

LI  NEC  I (2,2>,(  1,1 >,(220,1  ),( 220, 950), ( 1,950) 

SEND* 

VEXIT 

«E  PMD 


Figure  9-7.—  Sample  program  listing  for  the  ISOCLS  processor 
using  all  defaults  except  CHANNELS. 
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Figure  9-8.—  Sample  program  listing  for  the  ISOCLS  processor 

using  two  sets  of  control  cards . 
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Figure  9-8.—  Concluded. 
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Figure  9-9.—  Sample  program  listing  and  output  for  the  ISOCLS 

processor  using  most  options. 
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Figure  9-9.—  Continued 
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Figure  9-9.-  Continued 


K 


ORIGINAL  PAGE  IS 

OF  POOR  QUALITY 


' * 


■■uhe  * it,  r, . ■ 


Figure  9-9.—  Continued 


1 


A 

* 


A 

-I 

u 

© 


3 

tw 

k9 


IM 

31 


ML 

►O 

3 


03 

Ut 


8 


* 

O 


Cr> 

rx  n 

M • • 
x*  f» 


trt 

Mae 


— *xn— Jt 

xxm~x— 


-*  r*«o»  « 


-90«r« 

*«!——• 


09 

3*» 

• * 


*-l 

KU 


m 

— —A*  ***•*• 

(A 

'4« 

X 

tA 

A|-*9AIO« 

3 

Ote 

few 

o 

3 

-J 

A 

x 

-J 

• WAA"m 

VI 

X AO 

XX 

tel  3 

tei 

xrnoiwA 

r 

-AO 

AA 

u 

X 

< 

•te 

*<Mr» 

VI 

u 

X 

A * • • 

• • 

III 

9 

* 

-1 

zorrte« 

tel 

— -» 

X-v 

telX 

tel 

y 

lA 

9 

— Odf- 

(A 

tel 

3 — 

* 

A# 

X 

-IVAAtAiA 

• 

OX 

A* 

IA 

o 

At»9tefe>«iA 

fe~ 

Ui9 

w 

• 

*- 

• m • • • 

tel 

(A  — 

X 

X 

— 99»  1?  — 

fe» 

t-f M'teAtrte'te 

C 

x 3« 

« f 

(A 

w 

O 

IA 

• 

X 

X 

tAO  A 

XX 

tel  IA 

« 

te. 

m 

O 

w 

99 

u 

tr 

A • * • 

• • 

X X 

-i 

ui 

A 

te 

> 

tei 

A*  A 

AX 

few 

V. 

tel 

VI 

3X 

A 

3 

tA 

X 

»fe*feife*AA 

O 

X 

Xtel 

O 

3 

M 

IA 

— x «n«a 

w4«|f • 

X 

Ote- 

9 

few 

J 

o 

X 

3 

few 

Ute 

X 

X 

U 

X 

_l 

X 

-r-MO 

* 

lA 

OXX 

90 

• « 

tm 

mm 

o 

tel 

X 

X 

mm 

OtA  — 

XX 

few 

x 

* 

te. 

u. 

CN  • X • 

X 

SI 

O 

u 

o 

mm  • • • 

• • 

—tei 

'* 

X. 

3 

o 

c 

X 

•A  — 

9 X 

-IX 

w 

► 

X 

te  — 

*A 

y 

X 

X 

•A 

few 

5A  fe- 

3 

tel 

vy 

tA 

te 

J 

< 

■ 

9 

few  tel 

U 

mm 

X 

X 

3> 

3 

3 

3 

X 

X 

te  — 

y 

X 

X 

X 

tel 

ut 

X 

Xfe- 

X 

tel 

tel 

— X 

X 

mJ 

mi 

few  wiNArA 

M 

A 

I- 

tel 

a 

X 

tA 

3 

3 

A-NA 

X'A 

X3 

te- 

»• 

3 

■J 

mi 

3 

Ut 

X 

3 

O 

J 

u 

u 

~! 

IA3 

“ 

*W 

*•> 

VI 

VI 

3 SI 

9^T2 

^/Sfc 


Figure  9-9.—  Continued 
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Figure  9-9.—  Continued 
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Figure  9-9.-  Continued 
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Figure  9-9.—  Continued 
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Figure  9-9.—  Continued 


Figure  9-9.—  Continued 


Figure  9-9.—  Continued 
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Figure  9-9.—  Continued 


Figure  9-9.-  Continued 


Figure  9-9 Concluded 


10.  FEATURE  SELECTION  PROCESSOR  - SELECT 


The  feature  selection  processor  SELECT  provides  a means  of 
measuring  the  relative  importance  of  the  individual  channels 
and  obtaining  the  set  of  channels  which  provides  the  best  dis- 
crimination between  subclasses.  The  processor  allows  the  user 
to  choose  one  of  the  following  three  criteria  for  measuring  the 
separability  of  the  subclasses  for  a set  of  channels  or  for 
linear  combinations  of  the  channels. 

• Weighted  average  interclass  divergence 

• Weighted  average  transformed  divergence 

• Weighted  average  Bhattacharyya  distance 

Either  the  Exhaustive  Search  or  the  Without  Replacement  Procedure 
can  be  used  with  one  of  the  above  criteria  to  select  a "best" 
set  of  channels.  The  Exhaustive  Search  Procedure  determines 
the  best  set  of  k out  of  n channels  by  computing  the  separa- 
bility measure  for  every  possible  combination  of  k channels. 

This  results  in  n!/k! (n  - k) ! computations  of  the  separability 
measure.  The  computer  time  required  for  this  procedure  is  pro- 
hibitive for  a large  n.  In  such  cases,  the  Without  Replacement 
Procedure  could  be  used. 

The  Without  Replacement  Procedure  determines  the  best  k out  of 
n channels  in  the  following  manner.  First,  the  single  channel 
which  extremizes  the  separability  measure  is  selected.  Each  of 
the  remaining  (n  - 1)  channels  are  paired  with  the  best  single 
channel  in  selecting  the  best  pair  of  channels.  The  best  triplet 
is  determined  by  combining  the  remaining  (n  - 2)  channels  with 
the  best  pair.  The  process  continues  until  the  best  set  of 
k channels  has  been  selected.  The  number  of  times  the  separa- 
bility measure  must  be  computed  is  n + (n  - 1)  + (n  + 2)  + ••• 

(n  - k + 1) . 


/?S- 


A third  procedure,  the  Davidon-Fletcher-Powell  Procedure,  is  a 
powerful  iterative  descent  method  for  finding  a local  minimum  of 
a function  of  several  variables.  The  procedure  is  discussed  in 
reference  6.  How  the  procedure  applies  to  the  problem  of  channel 
selection  or  dimensionality  reduction  is  discussed  in  reference  7. 
In  SELECT,  the  Davidon-Fletcher-Powell  Procedure  computes  a 
k-by-n  linear  transformation  matrix  which  extremizes  a given 
separability  measure.  This  matrix,  referred  to  as  the  B-matrix, 
is  saved  on  the  BMFILE  (section  4.2)  and  optionally  is  punched  on 
cards  (B-Matrix  Deck,  section  3. 1.4. 2)  for  later  input  to  the 
CLASSIFY,  SELECT,  TRSTAT , SCTRPL,  or  DATA-TR  processors. 

An  initial  guess  for  the  B-matrix  must  be  provided  for  the 
Davidon-Fletcher-Powell  routines  and  may  be  input  via  the 
B-matrix  deck  or  BMFILE.  If  the  initial  guess  is  not  provided 
by  the  user,  SELECT  will  execute  the  Without  Replacement  Proce- 
dure first  to  obtain  a best  set  of  channels,  which  it  will  use 
to  initialize  a first-guess  B-matrix  for  the  Davidon-Fletcher- 
Powell  Procedure. 

In  addition  to  selecting  a best  set  of  channels  and/or  linear 
combinations,  the  processor  will  evaluate  any  one  of  the  three 
separability  measures  for  a specified  linear  combination  of  the 
channels.  The  linear  combination  must  be  input  via  the  B-matrix 
deck  or  the  BMFILE  if  SELECT  has  been  executed  previously  in  the 
same  run.  This  option  is  the  fourth  procedure  defined  under  the 
PROCEDURE  control  card. 

The  processor  will  also  evaluate  any  of  the  separability  measures 
for  Specified  sets  of  channels.  This  request  is  made  using  the 
EVALUATE  and  PROCEDURE  control  cards.  This  is  the  fifth  option 
defined  under  the  PROCEDURE  control  card. 

See  the  functional  flow  chart  for  the  SELECT  processor  (fig.  10-1) . 


10.1  INPUT  FILES 


The  SELECT  processor  requires  the  statistics  output  from  either 
STAT  or  ISOCLS.  Both  STAT  and  ISOCLS  write  the  SAVTAP  file  and 
optionally  punch  the  module  STAT  deck  (see  section  3. 1.4.1  for 
format)  which  may  be  used  as  input  to  SELECT. 

10.2  OUTPUT  FILES 

The  BMFILE  is  output  by  SELECT  when  the  Davidon-Fletcher-Powell 
Procedure  is  used  (see  appendix  H for  sample  execution) . The 
file  is  written  on  logical  unit  H (Fortran  unit  10)  and  must  be 
assigned  to  tape  or  Fastrand. 

The  corresponding  B-matrix  deck  is  punched  if  the  OPTION  PUNCH 
control  card  is  included  in  the  deck  setup. 

10.3  SCRATCH  FILES 

The  random  access  drum  files  are  used  as  scratch  files  in 
SELECT.  No  assignment  is  necessary. 

10.4  CARD  INPUT 

All  system  card  input  formats  referred  to  in  this  section  are 
defined  in  section  3. 

10.4.1  PROCESSOR  CARD 

The  processor  keyword  is  left  justified  starting  in  column  1. 
For  example, 

$ SELECT 

This  card  directs  the  monitor  routine  to  execute  SELECT  and 
initiates  loading  of  routines  used  by  SELECT. 


Zl'V-d 


10.4.2  SYSTEM  CARD  DECKS 


The  processor  will  read  and  process  the  module  STAT  deck  and  the 
B-matrix  deck. 

10.4.3  CONTROL  CARDS 

Table  10-1  lists  the  control  cards  which  are  recognized  by 
SELECT. 


10.4.4  FIELD  DEFINITIONS 

Field  definitions  do  not  apply  to  the  SELECT  processor. 

10.4.5  DECK  SETUP 

The  deck  setup  for  the  SELECT  processor  is  given  in  figure  10-2. 
The  @ in  column  1 indicates  the  master  punch  for  the  Univac 
system#  which  is  the  7-8  multipunch. 

10.5  CARD  OUTPUT 

SELECT  outputs  the  B-matrix  deck  on  option  (see  control  card 
OPTION  PUNCH) . This  is  optional  output  only  when  the  Davidon- 
Fletcher-Powell  Procedure  is  executed. 

10.6  SAMPLE  COMPUTER  RUNS 

The  two  computer  runs  listed  in  figures  10-3  and  10-4  demonstrate 
the  use  of  all  defaults  and  the  use  of  all  options,  respectively, 
for  the  SELECT  processor. 

Run  1 (fig.  10-3)  demonstrates  the  use  of  all  defaults.  Output 
from  this  run  is  not  shown. 

Run  2 (fig.  10-4)  demonstrates  the  use  of  all  options  with  a 
specific  criterion  and  procedure.  Input  of  a B-matrix  deck  is 
not  shown.  Some  of  the  output  from  this  computer  run  is  given. 


10.7  RESTRICTIONS 


The  system-related  restrictions  in  section  17  apply  to  the 
SELECT  processor. 


Two  large  arrays  are  dimensioned  in  SELECT  and  used  for  the 
variable  dimensioning  of  several  smaller  arrays.  Storage  in 
one  array  is  a function  of  the  number  of  subclasses  and  channels 
requested.  That  is, 
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Storage  requirements  in  the  other  array  are  dependant  on  the 
procedure  and  criterion  being  used.  The  Davidon-Fletcher-Powell 
Procedure  requires  much  more  storage  than  the  other  procedures, 
and  the  weighted  average  interclass  divergence  requires  more 
storage  than  the  other  criteria.  Requirements  for  each  are 
given  below. 


a.  Exhaustive  Search  and  Without  Replacement  Procedures 
• Weighted  average  interclass  divergence: 
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(10-2) 

• Weighted  average  transformed  divergence  and  Bhattacharyya 
distance : 


/Number  of\ / Number  of  , ,\  . , _ ___ 
\ channels  )\  channels  ) ~ 


(10-3) 


b.  Davidon-Fletcher-Powell  Procedure 

• Weighted  average  interclass  divergence: 
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• Weighted  average  transformed  divergence  and  Bhattacharyya 
distance: 

. • , ../Number  of\/ Number  \ . /1rt  c» 

Approximately  14^  channels  )(of  bestj  < 8000  (10-5) 
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10.8  DIAGNOSTIC  MESSAGES 
Message 

CORE  OVERFLOW  IN  SUBRAY  - 
NN  - STORAGE  LOCATIONS  NEEDED 
FOR  THIS  PROBLEM. 


CORE  OVERFLOW  IN  ARRAY  - 

NN*2  - STORAGE  LOCATIONS  NEEDED 

FOR  THIS  PROBLEM. 

TOO  MANY  EVALUATE  REQUESTS  -- 
REMAINDER  IGNORED. 


Explanation 

User  might  reduce  the  number  of 
subclasses  or  channels  or  try 
another  procedure.  The  SUBRAY 
array  is  used  for  temporary 
storage  in  SELECT  only.  (See 
restrictions , section  10.7) 

See  suggestions  for  first  diag- 
nostic message.  The  ARRAY  array 
is  used  throughout  the  system 
for  variably  dimensioned  storage. 

The  buffer  to  hold  EVALUATE 
requests  is  dimensioned  100. 

The  number  of  channels  and  chan- 
nels to  be  evaluated  for  each 
EVALUATE  request  are  stored  in 
this  array. 


K6 


Message 

GROUP  CARD  IN  ERROR  - IGNORED. 

PROGRAM  CANNOT  PROCESS  LESS 
THAN  2 CHANNELS. 

PROGRAM  CANNOT  PROCESS  LESS 
THAN  2 CLASSES. 

INVALID  CONTROL  CARD  - IGNORED. 

REDUCED  COVARIANCE  MATRIX  FOR 
CLASS  N IS  NOT  POSITIVE 
DEFINITE. 

THE  INCLUDE  REQUEST  FOR  CHAN- 
NEL N IS  NOT  A LEGITIMATE 
REQUEST  - IGNORED. 


Explanation 

Check  format  of  GROUP  option. 

At  least  two  channels  must  be 
input. 

At  least  two  classes  must  be 
input. 

Check  spelling  of  keyword. 

The  indicated  covariance  matrix 
cannot  be  inverted. 

The  indicated  channel  to  be 
included  is  not  among  the  input 
channels . 


TABLE  10-1.-  SELECT  PROCESSOR  OPTIONS  AND  CONTROL  CARDS 


Keyword 

(a) 


Parameter  and 
default  values 
(b) 


BEST 


N1'N2/*** 
Default:  None 


SUBCLASSES  c ,C2,««*,Ck 

k< number  of  sub- 
classes on  SAVTAP<60 
Default:  All  sub- 

classes on  the  SAVTAP 
file 


Function 

Finds  the  best  set  of  N^ , 
N2»***  channels  if  proce- 
dure 1 or  2 is  indicated. 

If  procedure  3 is  indicated, 
the  best  N^,  N2  linear  com- 
binations of  the  channels 
are  found.  Ni'N2'***  are 
integers  separated  by  commas. 
A request  can  be  made  for  a 
maximum  of  10  best  in  one 
call  to  SELECT. 

Provides  for  use  of  only 
subclasses  ci'C2'***  sta- 
tistics for  computation  of 
separability  measure;  allows 
the  user  to  select  a subset 
of  the  statistics  on  the 
SAVTAP  file  for  use  in  com- 
puting the  subclass  separa- 
bilities. Ci'C2f***  are 
integers  representing  the 
subclass  number  as  it  occurs 
in  the  SAVTAP  file. 


CHANNELS  Cl'C2'***'Ck 

k<number  of  channels 
on  SAVTAP <30 


Selects  the  best  set  of  chan 
nels  from  those  indicated  on 
this  card.  Must  be  a subset 


aThe  keyword  must  be  left  justified  in  card  columns  1 through  10. 

bThe  parameter  and  default  values  are  in  card  columns  11  through  72  (beginning 
in  any  column  past  10) . 


_ _ . jfv  • 

«&*#*#**•  • 


TABLE  10-1.—  Continued 


Parameter  and 
Keyword  default  values 

Default:  All  channels 

on  the  SAVTAPE  file 


OPTION  STATS 

Default:  No  statis- 

tics printed 

WEIGHTS  C1=XX, (C1,C2)=YY, 

OTHERS=ZZ 

Default:  All  weights 

set  to  1.0  for  crite- 
ria 2 and  3.  For  cri- 
terion 1,  weights  for 
subclass  pair  (i,j)  are 
j=e”D*-3^6 , where  D^j 
is  the  divergence  for 
subclass  pair  (i,j). 


Function 

of  the  channels  for  which 
statistics  are  input  via  the 
SAVTAP  file  or  module  STAT 

deck.  ci'C2/#**  are  inte9ers 
separated  by  commas. 

Prints  a summary  of  the  sta- 
tistics for  the  subclasses 
and  channels  actually  used 
in  SELECT. 

Sets  weights  for  all  subclass 
pairs  of  subclasses  Cl  to  XX, 
then  sets  subclass  pair  (Cl, 
C2)  to  YY;  sets  all  other 
subclass  pairs  to  ZZ.  Sub- 
class names  C1,C2,  etc., 
must  match  a subclass  name 
from  the  module  STAT  deck, 

the  SAVTAP  file,  or  a GROUP 
c 

name. 


cConsider  the  problem  of  selecting  channels  which  best  separate  wheat  from 
nonwheat  classes,  where  wheat  is  divided  into  subclasses  Wl,  W2,  and  W3, 
and  nonwheat  is  divided  into  the  subclasses  NW1,  NW2,  NW3,  and  NW4.  It  is 
desirable  to  set  all  weights  between  subclasses  in  each  class  to  zero, 
whereas  wheat/nonwheat  class  pair  weights  are  set  to  1.  This  can  be 
accomplished  by  the  following  WEIGHTS  control  cards:  Wl=l. ,W2=1. ,W3~1. , 

(Wl ,W2) =0 . ; and  (W1,W3)=0. , (W2,W3)=0. ,OTHERS=0.  Wl=l.  will  set  weights 
for  the  following  subclass  pairs  equal  to  1:  (W1,NW1) , (W1,NW2) , (W1,NW3) , 

(Wl ,NW4) , (W1,W2) , (W1,W3) . W2=l.  will  set  the  weights  for  the  following 

subclass  pairs  equal  to  1:  (W2,NW1) , (W2,NW2) , (W2,NW3) , (W2,NW4) , (W2,W3) , 

(W2,W1).  W3=l.  will  set  weights  for  the  following  subclass  pairs  equal 

to  1:  (W3,NW1) , (W3,NW2) , (W3,NW3) , (W3,NW4) , (W3,W1) , (W3,W2) . (W1,W2)=0. 

resets  this  subclass  pair  weight  to  0.  OTHERS=0.  sets  all  other  subclass 
pair  weights  to  0. 


TABLE  10-1.—  Continued 


Keyword 

B-MATRIX 


B -MATRIX 


EVALUATE 


Parameter  and 

default  values  Function 


CARDS 

Default:  None 


FILE 

Default:  None 


P P • • • 

^2 9 2 9 

Default:  None 


Indicates  that  the  B-matrix 
card  deck  immediately  fol- 
lows; results  in  the  evalua- 
tion of  the  separability 
measure  using  the  linear 
combinations  defined  by  the 
B-matrix  if  procedure  4 is 
indicated.  If  procedure  3 
is  indicated,  the  B-matrix 
will  be  used  as  a first 
guess  for  the  Davidon- 
Fletcher-Powell  Procedure. 

Indicates  that  a previous 
execution  of  SELECT  has 
written  the  BMFILE.  Depend- 
ing on  the  PROCEDURE  card, 
the  B-matrix  on  file  will  be 
used  as  an  initial  guess  for 
the  Davidon-Fletcher-Powell 
Procedure  or  in  evaluating 
the  separability  measure. 

Evaluates  the  separability 
measure  indicated  on  the 
CRITERION  card  for  channels 

*“1  'C2  ' * * * ' T^e  set 
channels  to  be  evaluated 

must  be  (1)  a subset  of  the 

channels  on  CHANNEL  card  and 

(2)  must  be  on  one  card. 

Several  sets  of  channels  may 


TABLE  10-1.-  Continued 


Keyword 


MODULE 


GROUP 


PROCEDURE 


Parameter  and 

default  values  Function 


be  input  by  using  more  than 
one  EVALUATE  card. 


Blank 


NAME, I, J, • • • 

Default:  No  grouping; 
individual  subclasses 
are  used. 


Indicates  that  the  module 
STAT  deck  immediately  fol- 
lows. The  SAVTAP  file  will 
be  written  as  this  card  deck 
is  read. 

Groups  the  training  sub- 
classes I,J pools  their 
statistics,  and  assigns  NAME 
as  the  group  name.  NAME  may 
be  any  six  characters.  Inte- 
gers must  correspond 

to  the  subclasses  as  they  occur 
in  the  module  STAT  deck  or  the 
SAVTAP  file. 


N N=l:  The  Exhaustive  Search 

Default:  N=2  Procedure  is  used;  N=2:  The 

Without  Replacement  Procedure 
is  used;  N=3:  The  Davidon- 

Fletcher-Powell  Procedure  is 
used;  N=4;  The  user-input 
B-matrix  is  used  to  evaluate 
the  separability  measure; 
and  N=5:  The  Evaluate  Chan- 

nels Procedure  is  used. 


IJWrr^ 

£ OS' 


TABLE  10-1.—  Continued. 


Keyword 

CRITERION 


INCLUDE 


STATFILE 


Parameter  and 

default  values  Function 


N 

Default:  N=1 


C1'C2' * * 
Default : 


None 


UNIT=N,FILE=M 


The  indicated  criterion  is 
used  to  measure  the  separa- 
bility between  subclasses. 

N=1  for  weighted  average 
divergence;  N=2  for  weighted 
transformed  divergence;  and 
N=3  for  weighted  average 
Bhattacharyya  distance. 

Includes  channels  ci'c2'*** 
in  the  best  set;  meaningful 
only  for  the  Without  Replace- 
ment Procedure.  C1'C2'*** 
must  be  a subset  of  channels 
on  CHANNELS  card. 

N is  the  Fortran  logical  unit 
number  to  which  the  SAVTAP 
file  has  been  assigned;  M is 

(1)  the  file  number  from 
which  the  training  statistics 
are  to  be  retrieved  and 

(2)  , if  the  module  STAT  deck 
is  input,  the  file  number  on 
which  the  statistics  are  to 
be  stored.  If  M^l,  this 
control  card  must  precede  the 
module  STAT  deck  in  the 
control  card  deck  setup. 


TABLE  10-1.—  Concluded. 


Keyword 

HED1 


HED2 


ICOUNT 


DATE 


COMMENT 


♦END* 


$END* 


Parameter  and 
default  values 

Any  60  characters 
beginning  in  column  11 
Default:  LYNDON  B. 

JOHNSON  SPACE  CENTER 

Any  60  characters 
beginning  in  column  11 
Default:  HOUSTON , 

TEXAS 

N 

Default:  N=300 


Any  12  characters 
beginning  in  column  11 
Default:  Current  date 

Any  60  characters 
beginning  in  column  11 
Default:  No  comment 

printed 

Blank 

Blank 


Function 

Replaces  first  header  line 
with  the  indicated  char- 
acters in  the  parameter  field. 

Replaces  second  header  line 
with  the  indicated  charac- 
ters in  the  parameter  field. 

Number  of  iterations  for  the 
Davidon-Fletcher-Powell 
Procedure . 

Prints  the  12  characters  in 
the  right  corner  of  the 
heading  in  place  of  the 
current  date. 

Prints  a comment  line  using 
the  60  characters  found  in 
the  parameter  field. 

Signals  the  end  of  the 
control  cards. 

Signals  the  end  of  all  card 
input  for  the  processing 
function . 


& 


INTERPRET  USER-INPUT 
OPTIONS 

SELECT  SUBSET  OF  STATIS- 
TICS FROM  THE  STAT  FILE 
SET  INTERCLASS  WEIGHTS 
FROM  USER  INPUT  OR 
DEFAULTS 


EXHAUSTIVE  SEARCH 


CSTA'N,  AMD  GET 

a mm$  for, 

ii  mtmt  SET  OF 


■a 


OBTAIN,  AND  GET 
STATISTICS  FOR, 
A UNIQUE  SET  OF 
CaANNELS 


OAVIDON-FLETCHER 

POWELL 


FIRST  GUESS 
AVAILABLE 


mwagjunmng'i 

WmmSnmMml 
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10-1.  — Functional  flow  chart  for  the  SELECT  processor. 
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Figure  10-2.“  Deck  setup  for  the  SELECT  processor. 
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Figure  10-4.—  Sample  program  listing  and  output  for  the 
SELECT "processor  using  all  options. 
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Figure  10-4.—  Continued 
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Figure  10-4.—  Continued 
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Figure  10-4.—  Continued 
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Figure  10-4.—  Continued 
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Figure  10-4.—  Continued 
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Figure  10-4.-  Continued. 
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Figure  10-4.—  Continued 
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Figure  10-4.—  Continued 
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Figure  10-4.—  Continued 
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Figure  10-4.—  Continued 
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Figure  10-4.—  Continued 
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Figure  10-4.-  Continued 
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Figure  10-4.—  Continued 
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Figure  10-4.—  Continued 
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Figure  10-4.—  Continued 


11.  CLASSIFICATION  PROCESSOR  - CLASSIFY 


The  classification  processor  CLASSIFY  classifies  the  MSS  image 
data  based  on  statistics  (mean  vectors  and  covariance  matrices) 
computed  from  the  training  fields. 

11.1  PROCEDURES 

* .in..—*— 

Given  the  statistics  for  each  subclass  of  interest , each  data 
point  within  the  defined  classification  field  from  the  MSS  data 

£ 

v tape  (DATAPE)  is  assigned  to  a subclass  by  one  of  two  procedures 

In  the  first  procedure,  the  user  does  not  define  categories  in 
his  input,  and  the  standard  m-class  maximum  likelihood  classifi- 
cation rule  is  followed.  However,  to  decrease  the  number  of 
times  the  density  function  must  be  computed,  the  classification- 
by-thresholding  procedure  proposed  by  Hallum  and  Minter  (ref.  8) 
and  improved  upon  and  implemented  by  Feiveson  (ref.  9)  is  used. 
The  standard  classification  rule  (i.e.,  when  no  categories  are 
defined  by  the  user)  is  outlined  in  section  11.1.1. 

In  the  second  procedure,  the  user  defines  categories  in  his 
input,  and  the  sum-of -normal-densities  classification  rule  is 
followed,  as  set  out  in  section  11.1.2. 

11.1.1  STANDARD  M-CLASS  CLASSIFICATION 

Assuming  multivariate  normal  probability  density  functions  and 
using  the  maximum  likelihood  classification  rule,  the  data 
vector  X = (X^XjfX^,  • • • ,XN)  is  assigned  to  subclass  i in  the 
following  manner. 

t • 

The  assumed  joint  probability  density  function  of  X,  when 
material  of  type  i fills  the  MSS  field  of  view,  is  given  in 
the  following  equation. 

/jH\ 

U,  i>dr^ 

. . ....  „ . .. 


where: 


= a priori  probability  for  subclass  i 
N = number  of  channels  used  for  classification 
= covariance  matrix  for  subclass  i 
X = data  vector  (xi»x2 ,X3, • • • ,XN) 

Q±  = (X  - yi)TK"1(X  - yi) 

y^  = mean  vector  for  subclass  i 

Because  of  the  exponential  form  of  p^  and  because  An (p^)  is  a 
monotonically  increasing  function  of  p^,  for  computational  pur 
poses  it  is  convenient  to  define  a new  function  by 

vi  = *n(Pi)  = An(p±)  - |An(2Tt)  - ^-Jtn | K± | 

- |(X  - y±) TkT1  (X  - y^  (11 

The  data  vector  X is  classified  as  belonging  to  subclass  i if 
> Vj  for  all  i / j,  where  j = l,2r3,***,n  and  n = number  of 
subclasses. 

The  number  of  times  the  function  must  be  computed  may  be 
reduced  by  the  use  of  thresholds;  i.e. , real  numbers 
(independent  of  X)  such  that 

V.(X)  > y±j  implies  Vi(X)  > V..  (x) 
and  Vj (X)  > implies  Vj (X)  > V^(X) 

where  i,j  = l,2,3,*»*,n  and  i ^ j. 


X,,  ■■  --  T;:- "T- ■■  ■ --  ••»•••••••: ; ■■  T ' 


The  utility  of  these  thresholds  is  that,  if  V^(X)  > y^ . , Vj (X) 
need  not  be  computed.  Once  the  values  for  y^  have  been  deter- 
mined, they  may  be  used  for  each  observation  vector  X. 


11.1.2  SUM-OF-NORMAL-DENSITIES  CLASSIFICATION 

Also,  assuming  multivariate  normal  probability  density  functions, 

T 

the  category  classifier  classifies  the  data  vector  X = (X^Xj* 
X3,*»»,x  ) to  category  j and  subclass  i in  the  following  manner. 


The  probability  density  function  for  each  category  j is  computed 
by  the  following  equation. 


Pj  (X) 


Pi  --0 • 5Qi 

(2TT)N/f5|Ki|1-/2 


(11-4) 


where 

i = subclass  number 
j = category  number 

kj  = number  of  subclasses  in  category  j 

= a priori  probability  for  subclass  i in  category  j 

N = number  of  channels  used  for  classification 

Having  computed  the  probability  density  function  for  all  cate- 
gories, the  data  vector  X is  classified  as  belonging  to  cate- 
gory j if  Pj  > p^,  where  £ = l,2,3,*»*,q  for  all  j ? £ and 
q = number  of  categories. 

The  data  vector  is  classified  as  belonging  to  subclass  i if  the 
probability  density  function  for  subclass  i in  category  j is 
such  that  p..  > p.  for  all  i / m,  where  m = 1 , 2 , 3 , • • • ,k  . . In 
the  computation  of  , if  the  value  of  the  quadratic  form 
is  smaller  than  -88,  the  computer  cannot  store  the  computed 
value  of  e^i.  Thus,  e^i  = 0 for  all  CK  < -88.  In  the  case 


of  all  Pj  = 0 for  j = l,2,3,#,,,q,  the  data  point  will  not  be 
classified;  it  will  be  assigned  to  a null  subclass. 

When  the  line  printer  map  of  the  classified  data  is  displayed, 
each  data  point  is  printed  with  the  symbol  representing  the 
legitimate  subclass  to  which  the  data  point  belongs,  and  the 
null  subclass  is  printed  with  the  blank  symbol.  Figure  11-1 
gives  the  functional  flow  of  the  CLASSIFY  processor. 

11.2  INPUT  FILES 

An  MSS  data  tape  (DATAPE)  must  be  input  to  the  CLASSIFY  processor. 
The  tape  assignment  defaults  to  logical  unit  C (Fortran  unit  3) ; 
however,  by  input  of  the  DATAFILE  control  card,  the  user  may 
assign  any  available  logical  unit.  (See  table  4-1  for  file 
assignments  and  section  3.2,  MSS  Image  Data  Tapes,  for  further 
information  on  format.) 

11.3  OUTPUT  FILES 

The  classification  results  are  output  on  the  MAPTAP  file  (see 
appendix  C)  which  is  assigned  to  logical  unit  B (Fortran  unit  2) . 
In  the  event  of  card  input  of  the  module  STAT  deck,  the  sta- 
tistics will  be  output  on  the  SAVTAP  file  (see  section  4.1). 

11.4  SCRATCH  FILES 

The  processor  requires  no  scratch  file. 

11.5  CARD  INPUT 

All  required  input  card  types  are  described  below. 


11.5.1  PROCESSOR  CARD 

The  processor  keyword  is  left  justified  beginning  in  column  1; 
the  parameter  FILE  is  punched  starting  in  column  11.  For  example, 

$CLASSIFY  FILE=N 

This  card  directs  the  system  monitor  routine  to  load  all  routines 
used  by  the  CLASSIFY  processor.  The  parameter  FILE=N  informs  the 
processor  to  output  the  current  classification  results  on  file  N 
of  MAPTAP.  If  no  integer  file  number  is  specified,  the  processor 
defaults  to  file  1 of  MAPTAP. 

11.5.2  SPECIAL  SYSTEM  DECKS 

The  training  statistics  may  be  input  by  means  of  the  module  STAT 
deck.  The  B-transformation  matrix  may  be  input  by  means  of  the 
B-matrix  card  deck.  The  EOD-LARSYS  deck  formats  are  described 
in  section  3.1. 

11.5.3  CONTROL  CARDS 

The  cards  described  in  table  11-1  are  the  complete  set  of  control 
cards  recognized  by  the  CLASSIFY  processor.  All  options  available 
to  the  user  are  exercised  by  use  of  the  appropriate  processor  con- 
trol card.  If  a default  condition  is  specified,  the  control  card 
is  optional.  If  no  default  condition  is  specified,  the  control 
card  is  mandatory  input.  Ordering  of  the  sequence  of  processor 
control  cards  is  unnecessary,  with  the  exceptions  that  (1)  the 
♦END*  card  must  follow  the  last  processor  control  card,  (2)  the 
$END*  card  must  follow  the  last  field  definition  card  for  an  area 
to  be  classified,  and  (3)  the  STATFILE  control  card  must  precede 
the  input  of  the  module  STAT  deck  in  some  cases.  See  table  11-1. 


11.5.4  FIELD  DEFINITIONS 

Areas  to  be  classified  are  communicated  to  the  classification 
processor  by  using  the  field  definition  data  card  described  in 
section  3.1.3,  which  contains  the  scan  line  and  sample  coordi- 
nates for  the  area  over  which  classification  is  to  be  performed. 
At  least  one  field  definition  card  must  be  in  the  run  deck  imme- 
diately following  the  *END*  control  card.  As  many  field  defini- 
tion cards  as  there  are  areas  to  be  classified  may  be  input.  The 
processor  will  classify  each  field  in  the  order  it  is  identified, 
will  print  on  the  line  printer  the  first  110  samples  of  the 
classification  map,  and  will  print  any  optional  output  prescribed 
by  the  control  cards  for  each  field  classified.  The  scan  line 
and  sample  coordinates  specified  on  the  field  definition  card 
must  be  available  on  the  input  MSS  data  tape  (DATAPE) . 


11.5,5  DECK  SETUP 

The  deck  setup  for  the  CLASSIFY  processor  is  given  in  figure  11-2. 
The  @ in  column  1 indicates  the  master  punch  for  the  Univac  sys- 
tem, which  is  the  7-8  multipunch. 

11.6  CARD  OUTPUT 

The  classification  processor  outputs  no  punched  cards. 

11.7  SAMPLE  COMPUTER  RUNS 

The  first  sample  test  run  (fig.  11-3)  illustrates  the  manner  in 
which  most  of  the  available  CLASSIFY  options  are  exercised.  The 
SAVTAP  file,  which  has  been  created  by  a previous  execution  of 
the  STAT  processor,  is  read  from  tape.  SAVTAP  contains  descrip- 
tions and  statistics  for  23  fields,  8 classes,  and  9 subclasses. 
The  class-subclass  correspondence  is  as  follows. 


Class 

number 

Class  name 

Subclass 

number 

Subclass  name 

1 

S0YBN1 

1 

SOYBN1 

2 

CORN 

2 

CORN 

3 

OATS 

3 

OATS 

4 

WHEAT 

4 

WHTl 

5 

WHT2 

5 

RDCL1 

6 

RDCL1 

6 

ALFALF 

7 

ALFALF 

7 

RYE 

8 

RYE 

8 

BRSOIL 

9 

BRSOIL 

The  grouping  option  combines  subclasses  4 and  5 (WHT1  and  WHT2 ) 

to  form  a new  subclass  WHEATS, 

, and  the  number  of  subclasses 

used 

in  classification  will  be  eight.  Note  that 

WHEATS  is  a new 

sub- 

class  belonging  to  the  class  WHEAT,  and  the 

class  name  WHEAT  — 

not  the  subclass  name  WHEATS  - 

- goes  on  the  1 

CATEGORY  control 

card 

Figure  11-4  is  a sample  computer  program  listing  using  all 
default  options.  No  output  is  provided. 


11.8  RESTRICTIONS 

The  system-related  restrictions  described  in  section  17,  along 
with  the  following,  apply  to  the  CLASSIFY  processor. 


The  category,  class,  and  subclass  relationship  is  as  follows: 


Number  of  < Number  of  < Number  of  <50 
categories  - classes  - subclasses  - 


(11-5) 


The  size  of  the  B-matrix  cannot  exceed  450  locations: 

/Number  of  linear \ /Number  of  chan-  \ < -50  (11-6) 

l combinations  /vnels  in  B-matrix/  - ' ' 

The  channels  used  in  computing  the  B-matrix  automatically  replace 
the  channels,  if  any,  on  the  CHANNELS  control  card. 


The  largest  sample  number  of  the  classification  field  minus  the 
smallest  sample  number  of  the  classification  field  cannot  exceed 
1000. 


Beginning  with  the  smallest  sample  number  of  the  classification 
field,  only  the  next  110  samples  are  displayed  on  the  line-printer 
map  output  by  CLASSIFY,  but  the  entire  classified  field  is  dis- 
played on  the  line-printer  map  output  by  DISPLAY. 

When  applying  the  category  classifier  option,  12  500  storage 
locations  are  reserved  for  the  data  such  that 

e end  - sample  start\ /Number  of\  „ , 0 ___  ... 

F+  1 — M channels  ) ~ 12  500  (11"7) 


When  applying  the  standard  classifier  option,  the  table  computed 
for  the  class-pair  thresholding  procedure  shares  this  storage  of 
12  500  locations  reserved  for  the  data  such  that 


.( 


Number  of 
subclasses 


/Number  of 
\subclasses 
2 


2) 


+ 


Number  of 
subclasses 


/Sample  end  - sample  start\Number  of  < -2  ,-qq 
\ 5 + 1 /channels  - 


(11-8) 


11.9  DIAGNOSTIC  MESSAGES 

The  diagnostic  messages  and  the  routines  in  which  they  appear  are 
as  follows . 


11.9.1  CLSFY1  ROUTINE 
Message 

*****  CLSFY/CLSFYl/CLSFlA  

THE  COVARIANCE  MATRIX  FOR  SUB- 
CLASS NO.  XX  IS  EITHER  SINGULAR 
OR  NOT  POSITIVE  DEFINITE  - THE 
DETERMINANT  = XXXX.XXXX 


Explanation 

The  determinant  of  each  sub- 
class covariance  matrix  is 
checked  by  CLASSIFY  to  see  that 
it  is  a positive  nonzero  value. 

A zero  value  indicates  a singular 


Explanation 


TERMINATING  PROGRAM 


EXECUTION  ***** 


matrix,  and  a negative  value  i 
indicates  a nonpositive  defi- 
nite matrix.  If  either  con- 
dition occurs  for  any  subclass 
covariance  matrix  to  be  used 
in  classification,  the  proces- 
sor will  stop.* 


11.9.2  CLSFY2  ROUTINE 


Message 


WIDTH  OF  RECTANGULAR  FIELD  SUR- 
ROUNDING CLASSIFICATION  FIELD 
CANNOT  EXCEED  1000  POINTS. 


TOO  MUCH  DATA  REQUESTED. 


Explanation 

The  largest  sample  of  the  clas- 
sification field  minus  the 
smallest  sample  of  the  classi- 
fication field  cannot  exceed 
1000  samples.  Reduce  amount 
of  samples  per  scan  line. 

When  too  much  data  has  been 
requested,  (1)  for  the  stand- 
ard classifier,  reduce 
parameters  so  that 

/Number  of  _ /Number  of  _ 2) 
\ subclasses  /\ subclasses  I 


+ Number  of  /Points  per\ 

subclasses  \scan  line  / 

* (“if)  S « or 

(2)  for  category  classifier, 
reduce  data  so  that  points  per 


'A  probable  source  of  an  invalid  covariance  matrix  is  a module 
STAT  deck  which  has  been  incorrectly  formatted  and  thus  is  not 
producing  good  training  class  statistics.  Another  possible 
source  is  that  the  SAVTAP  file  does  not  contain  valid  statis- 
tical data. 


■ 


Message 


11.9.3  REDIF2  ROUTINE 


*****  CLSFY/REDIF2  BAD 

CARD  INPUT  DETECTED  ON  ATTEMPT 
TO  READ  B-MATRIX  INFORMATION 
AS  DIRECTED  BY  THE  CONTROL 
CARD  • • • 

' CCCC  • • • CCCC ' 

*****  TERMINATING  PROGRAM 
EXECUTION  FROM  REDIF2  ***** 

**  CLSFY/REDIF2  ~ B-MATRIX 
INPUT  FROM  BMFILE  - BAD  INPUT 
VALUES  DETECTED:  NO.  COMBINA- 
TIONS ( BMCOMB ) = , NO. 

CHANNELS  (BMFEAT)  = , 

CHANNEL  VECTOR  (FETVC2)  = . 

AT  LEAST  TWO  (2)  CATEGORIES 
MUST  BE  ASSIGNED.  EXITING  FROM 
REDIF2 . 

11.9.4  SETUP2  ROUTINE 
Message 

*****  CLSFY/SETUP2  •••  ERROR 
CONDITION  ON  ATTEMPT  TO  POSI- 
TION MAPTAP  TO  FILE  NO.  XX 

*****  ERROR  STATUS  CODE  = YY 
ABORTING  THE  RUN  ***** 


Explanation 

scan  line  * number  of  chan- 
nels < 12  500. 


Explanation 

The  input  B-MATRIX  control  card 
is  printed  out  as  part  of  the 
error  message.  One  of  the  data 
cards  following  it  is  incor- 
rectly formatted.  Check  deck 
setup  and  B-matrix  card  deck. 


Invalid  data  from  the  BMFILE 
has  been  deleted. 


In  exercising  the  category 
option,  two  or  more  categories 
must  be  used. 


Explanation 

The  CLASSIFY  processor  attempted 
to  position  the  output  classi- 
fication results  file  (MAPTAP) 
to  the  file  number  specified 
on  the  $CLASSIFY  processor  card. 
Possibly  (1)  more  files  were 


JJKfO 


i 

x 


indicated  than  currently 
existed  on  the  MAPTAP  file, 

(2)  bad  tape  if  the  file  is 
assigned  to  tape,  or  (3)  the 
format  of  the  $CLASSIFY  proc- 
essor card  is  incorrect. 


AN  ERROR  HAS  OCCURRED  IN  GROUP- 
ING CLASSES  INTO  CATEGORIES. 
CHECK  THE  FOLLOWING: 

NOT  ALL  OF  THE  CLASSES  HAVE 
BEEN  ASSIGNED  TO  A CATEGORY. 

A CLASS  NAME  ON  THE  CATEGORY 
CARD  HAS  BEEN  MISSPELLED. 

CLASS  NAMES  FROM  SAVTAP  FILE 

ARE : . CLASS 

NAMES  FROM  CATEGORY  CARDS 
ARE: 


When  an  error  occurs  in  group- 
ing classes  into  categories, 
either  one  or  more  class  names 

(1)  have  not  been  assigned  or 

(2)  have  been  misspelled.  The 
program  lists  the  class  names 
as  submitted  from  the  SAVTAP 
file  or  cards.  Check  these  for 
errors.  If  neither  (1)  nor 

(2)  is  applicable,  check  the 
module  STAT  deck  to  assure  that 
class  names  are  left  justified 
in  the  field. 


ay? 


TABLE  11-1.-  CLASSIFY  PROCESSOR  OPTIONS  AND  CONTROL  CARDS 


Parameter  and 

Keyword  default  values 

(a)  (b) 

SUBCLASS  Kx , K2 , • • • , Ki 
l<Ki<60; 

i = number  of  sub- 
classes in  training 
statistics 
Default:  All  the 

training  subclasses 
are  used. 


Function 

K's  are  integers  comprising  the 
set  of  subclass  numbers  used  by 
the  processor  to  clas jify  the 
unknown  data  points;  must  be  a 
subset  of  training  subclasses 
designated  as  they  occur  on  the 
SAVTAP  file. 


CHANNELS 


STAT=NlfN2,‘“rNk, 

DATA=M1,M2,‘*‘,Mk 

k<30 

Default:  (1)  If  exe- 

cuted back  to  back 
with  SELECT,  the  chan- 
nels selected  by  the 
SELECT  processor  are 
used;  (2)  If  a 
B-matrix  is  input,  the 
channels  used  in  com- 
puting the  matrix  are 
used;  (3)  Otherwise, 
all  channels  in  the 
training  statistics 
are  used. 


N1'N2'***'NK  are  t*ie  c^annel 
numbers  (integers)  from  the 

SAVTAP  file  to  be  used  in 
classification;  ,M2 , • • • ,MR 
are  the  channel  numbers 
(integers)  from  the  MSS  data 


tape  (DATAPE) . The  number  of 
channels  selected  from  SAVTAP 


and  DATAPE  must  be  equal . 


CATEGORY  CATNAM/NAME  x , NAME  2 , • • • 
Default:  If  no 


Informs  the  processor  that  the 
category  classifier  option 


aThe  keyword  must  be  left  justified  in  card  columns  1 through  10. 

^The  parameter  and  default  values  are  in  card  columns  11  through  72  (beginning 
in  any  column  past  10) . 


TABLE  11-1.—  Continued. 


Keyword 


GROUP 


Parameter  and 

default  values  Function 


categories  are  de- 
fined, the  standard 
classifier  is 
applied . 


will  be  applied  and  defines 
one  category  name  (CATNAM)  and 
the  class  names  (NAME^'s)  for 
this  category.  All  subclasses 
for  a class  are  assigned  to 
this  category.  CATNAM  and 
NAME^  may  be  up  to  six  charac- 
ters, and  NAME^  must  match  a 
class  name  on  the  SAVTAP  file. 

A slash  (/)  separates  the  cate- 
gory name  from  the  class  name. 

Note : (1)  Every  class  must  be 

assigned  to  a category  unless 
the  class  was  eliminated  by 
omitting  all  of  its  subclasses 
on  the  SUBCLASS  control  card; 

(2)  At  least  two  categories 
must  be  defined;  (3)  Continu- 
ation of  the  list  of  class 
names  in  the  category  on 
another  card  is  indicated  by 
an  asterisk  after  the  last 
class  name  of  that  card.  The 
next  card  should  continue  the 
list  of  class  names  in  col- 
umns 11-72.  (See  test  example 
in  section  11.6.) 


SUBNAM , , K2 , • • • , Ki ' s are  integer  subclass  num- 
1<K^<60;  i=number  of  bers  taken  from  the  set  of 
training  subclasses  available  training  subclasses. 


TABLE  11-1.—  Continued. 


Parameter  and 

Keyword  default  values  Function 

Default:  Subject  to  The  processor  creates  a new 

the  SUBCLASS  control  training  subclass  by  combining 

card,  each  individual  the  statistics  of  the  training 

training  subclass  is  subclasses  listed.  The  training 

used  as  a possibility  subclasses  used  are  not  there- 
for unknown  data  after  available  as  individual 

sample  classification  subclass  possibilities  for  an 

unknown  data  sample.  The  set 
of  training  subclasses  to  be 
used  is  renumbered  by  the  proc- 
essor to  account  for  the  new 
grouped  subclass  and  the  train- 
ing subclasses  deleted  by 
grouping.  The  revised  set  of 
training  subclasses  is  used 
for  all  processor  output. 

SUBNAM  may  be  from  one  to  six 
characters  and  will  become  the 
name  for  a new  training  subclass. 


DATAFILE  UNIT=N , FILE=M  N is  the  Fortran  logical  unit 

Default:  N=3,M=1  number  to  which  the  MSS  data 

tape  (DATAPE)  has  been  assigned; 
M is  the  file  number  of  the  tape 
to  be  processed.  For  back-to- 
back  executions  of  several 
processors,  if  usirg  the  same 
file  number,  only  one  DATAFILE 
control  card  need  be  input. 


t 


352* 


Keyword 

STATFILE 


MODULE 


B-MATRIX 


TABLE  11-1.—  Continued. 


Parameter  and 

default  values  Function 


UNIT=N,FILE=M  N is  the  Fortran  logical  unit 

Default:  N=1 ,M=1  number  to  which  the  SAVTAP  file 

has  been  assigned;  M is: 

(1)  the  number  of  the  file  to 
be  processed  or  (2)  , if  the 
module  STAT  deck  is  input,  the 
number  of  the  file  for  storing 
the  statistics.  If  M/l,  this 
control  card  must  precede  the 
module  STAT  deck  in  the  control 
card  deck  setup. 


Blank 

Default:  Training 

subclass  statistics 
are  read  from  the 
input  file  SAVTAP. 


Indicates  to  the  processor  that 
the  training  subclass  statis- 
tics will  be  input  on  cards. 

The  module  STAT  deck  must  imme- 
diately follow  this  control 
card.  See  section  3. 1.4.1  for 
further  details. 


CARDS  or  FILE 
Default:  No  transfor- 

mation of  training 
subclass  covariance 
matrices 


Informs  the  processor  that  the 
B-transformation  matrix  is  to 
be  input  and  applied  to  the 
training  subclass  statistics 
prior  to  classification.  If 
FILE  is  placed  in  the  parameter 
field,  the  mode  of  B-matrix 
input  will  be  from  BMFILE ; if 
CARDS  is  specified,  the 
B-matrix  card  deck  must  imme- 
diately follow  this  control 


TABLE  11-1.—  Continued. 


Keyword 


APRIORI 


Parameter  and 

default  values  Function 

card.  The  channels  which 
were  used  to  derive  the  B- 
transformation  matrix  will  be 
the  channels  used  by  the 
processor  in  classification. 
(See  section  3. 1.4. 2 for 
further  details.) 


a1*a2',,,,AM 


or 

n*a.,k*an+1,---,ah 

M<60 

Default:  If  executing 

the  standard  classi- 
fier, each  subclass  is 
given  an  equal 
a priori  value.  If 
executing  the  category 
classifier,  each  cate- 
gory is  given  an  equal 
a priori  value  which 
is  divided  equally 
among  the  subclasses 
in  that  category. 


A priori  values  may  be  input  by 
subclass,  class,  or  category. 

N and  K are  arbitrary  repeti- 
tion factors,  and  A^'s  are 
decimal  numbers  such  that 
M 

£ Ai  - 
i=l 

M = number  of  training  sub- 
classes, training  classes,  or 
categories.  If  input  by  class 
or  category,  the  setup  routine 
will  distribute  the  a priori 
values  among  the  subclasses  in 
the  following  manner: 

_ , _ Class  a Priori  values 

* c Number  of  subclasses 

in  that  class 

Category  a priori 

_ . values 

By  category  = Number  of  sub. 

classes  in  that 
category 

The  order  in  which  the  A^'s  are 
input  must  be  the  order  in  which 


11 


'k  , 


* 


1 

n 


TABLE  11-1 

Parameter  and 

Keyword  default  values 


OPTION  STATS 

Default:  No  training 

subclass  statistics 
printout 


HED1  Any  60  characters 

beginning  in  column  11 
Default:  LYNDON  B. 

JOHNSON  SPACE  CENTER 


Concluded. 

Function 

the  category , class , or  subclass 
was  defined. 

Training  statistics  will  be 
printed  out  for  each  subclass , 
reflecting  the  B-transformation, 
if  any,  and  the  Cholesky  fac- 
torization of  the  covariance 
matrices. 

Replaces  first  header  line  with 
the  indicated  characters  in  the 
parameter  field. 


HED2 

Any  60  characters 

beginning 

in  column 

11 

Default: 

HOUSTON, 

TEXAS 

DATE 

Any  12  characters 

beginning 

in  column 

11 

Default: 

Current  date 

Replaces  second  header  line 
with  the  indicated  characters 
in  the  parameter  field. 

Prints  the  indicated  12  charac- 
ters in  the  right  corner  of  the 
heading  in  place  of  the  current 
date. 


COMMENT  Any  60  characters 

beginning  in  column  11 
Default:  No  comments 

printed 

*END*  Blank 


Prints  a comment  line  using  the 
60  characters  found  in  the 
parameter  field. 

Signals  the  end  of  the  control 
cards. 


SEND*  Blank 


Signals  the  end  of  card  input 
for  the  processing  function. 


A*'- 


Figure  11-1.—  Concluded 


* .* 


Results  to  be  output]/ 

on  file  1 Of  MAPTAP  (J$CLASSIFY  FILE=1 


(Front  o!  dock) 


(a)  For  independent  execution  with  all  possible  files  assigned. 
Figure  11-2.—  Deck  setup  for  the  CLASSIFY  processor. 
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Figure  11-3.—  Sample  program  listing  and  output  for  the  CLASSIFY  processor  using 
all  options  except  B-MATRIX  and  CHANNELS  default. 
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Figure  11-3.—  Continued. 
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Figure  11-3.—  Continued 
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Figure  11-3.—  Continued 
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Figure  11-3.-  Continued. 


,M,0'|  «»f«T  •’  C»t««tU 

ct«»ir«t*TiON  stuot  •••  »*M*f  Mtl 


Figure  11-3.—  Continued. 
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Figure  11-3.—  Continued. 
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Figure  11-3.-  Continued. 
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Figure  11-3.—  Continued. 
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Figure  11-3.—  Continued 
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Figure  11-3.—  Continued 
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Figure  11-3 .—  Concluded . 
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Figure  11-4.—  Sample  program  listing  for  the  CLASSIFY  processor  using 

all  default  options. 


12.  PERFORMANCE  DISPLAY  PROCESSOR  - DISPLAY 


The  DISPLAY  processor  reads  the  MAPTAP  tape  output  by  CLASSIFY 

and  performs  the  following  functions. 

a.  Provides  a line  printer  map  of  each  classified  field  on 
MAPTAP.  The  training  and  test  fields  within  the  classified 
image  are  outlined. 

b.  Produces  classification  summaries  for  each  classified  field, 
which  gives  a breakdown  on  the  number  of  pixels  classified 
into,  and  the  number  of  pixels  thresholded  from,  each  sub- 
class, class,  and  category. 

c.  Produces  (optionally)  an  intensive  test  site  (ITS)  classi- 
fication summary  for  one  crop  type  versus  all  other  crop 
types;  the  user-specified  crop  may  be  a category,  class,  or 
subclass . 

d.  Allows  the  user  to  designate  fields  to  be  excluded  from  the 
classification  summaries.  Fields  may  be  designated  "uniden- 
tifiable" or  "other."  Pixels  within  the  unidentifiable 
fields  are  counted  and  are  not  considered  in  the  classifica- 
tion summaries.  Pixels  within  the  designated  "other"  fields 
are  counted  as  a separate  crop  type  regardless  of  how  they 
were  classified.  These  pixels  are  included  in  category 
"other"  in  the  ITS  report.  (See  section  12.4.4  for  sample 
input  of  designated  fields.) 

All  pixels  within  the  designated  areas  are  printed  with  the 
pound  (#)  symbol. 

e.  Assigns  a pixel  to  the  threshold  class  if  thresholding  is 
requested  and  if  CL  > t^,  where 

T -1 

CL  = the  value  of  the  quadratic  form  (X  - y^)  (X  - y^) 

as  computed  by  CLASSIFY  (section  11.1.1)  for  subclass  i 

y^  = mean  vector  for  subclass  i 


1 2^r 


= covariance  matrix  for  subclass  i 

t.  — threshold  value  for  subclass  i 
x 

Allows  t^  to  be  determined  in  one  of  four  ways : 

• User  input  — The  user  inputs  the  exact  threshold  value. 

See  control  cards  THRESHOLD  and  OPTION  THRESHOLD  VALUE. 

• Chi-square  option  — The  user  inputs  confidence  levels  for 
each  subclass  on  the  THRESHOLD  card  and  includes  the 
OPTION  CHI  SQUARE.  The  program  obtains  the  chi-square 
threshold  value  from  an  internal  chi-square  functional 
routine . 

• Empirical  option  — The  user  inputs  confidence  levels  for 
each  subclass  on  the  THRESHOLD  card  and  includes  the  OPTION 
EMPIRICAL  card.  The  program  determines  the  empirical  dis- 
tribution function  for  each  subclass  from  the  cumulative 
histogram  of  Q.  for  correctly  classified  pixels  in  the 
ground  truth  areas  (i.e.,  training  or  test  fields),  as 
shown  in  the  following  example. 

100 
95 

Confidence 
level 


0 

From  this  curve,  the  user  input  of  a 95-percent  confidence 
level  for  subclass  i would  result  in  a threshold  value  of 
6.0. 

See  reference  10  for  more  information  on  the  use  of 
empirically  computed  thresholds. 

• Fisher  F-distribution  option  — The  user  inputs  confidence 
levels  for  each  subclass  on  the  THRESHOLD  card  and  includes 


1 


the  OPTION  FISHER  card.  The  program  obtains  the 
F-distribution  threshold  values  from  an  internal  routine. 
If  a computational  overflow  occurs  in  the  routine,  the 
threshold  value  for  that  subclass  is  set  equal  to  999.999. 

g.  Produces  plots  of  the  empirical  distribution  function  when 
OPTION  PLOT  is  exercised. 

h.  Performs  (optionally)  a four-nearest-neighbors  spatial  fil- 
tering on  the  classified  image.  This  algorithm  takes  into 
consideration  that,  in  many  instances,  a pixel  is  most  likely 
to  be  like  its  nearest  neighbors.  When  the  option  is  exer- 
cised via  the  OPTION  FILTER  control  card,  the  four  nearest 
neighbors  of  each  pixel  are  examined.  If  all  the  neighbors 
are  classified  the  same  and  the  pixel  in  question  is  classi- 
fied differently,  then  it  is  assumed  that  the  pixel  was 
classified  incorrectly  and  its  classification  is  changed. 

In  the  following  example,  the  pixel  classified  as  X will  be 
changed  to  C.  (See  reference  11  for  more  information  on 
this  algorithm. ) 

Line  1 C 

2 C X C 

3 C 

i.  Outputs  (optionally)  the  classified  image  onto  tape  (MAPFIL) 
in  either  LARSYS  II  or  Universal  format  via  the  FORMAT  NAME 
control  card . 

j.  Provides  classification  performance  summaries  for  ground 
truth  areas  within  the  classified  image.  The  following  six 
performance  summaries  are  available  to  the  user.  The  fields 
in  these  reports  are  either  training  fields  used  in  the  STAT 
or  ISOCLS  processor  and  transmitted  to  DISPLAY  via  the  MAPTAP 
file  or  test  fields  input  directly  to  DISPLAY  (see  sec- 
tion 12.4.4). 

• Field  by  subclass 
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• Field  by  class 

• Field  by  category 

• Class  by  subclass 

• Class  by  class 

• Class  by  category 

Figure  12-1  shows  the  functional  flow  of  the  DISPLAY  processor. 
12.1  INPUT  FILES 

The  only  input  file  required  for  DISPLAY  is  the  MAPTAP  (sec- 
tion 4.4)  file  output  by  CLASSIFY.  This  file  must  be  assigned 
to  logical  unit  B (Fortran  unit  2) . If  DISPLAY  is  executed 

I ' 

back  to  back  with  CLASSIFY,  the  file  may  be  assigned  to  Fastrand. 
However,  if  DISPLAY  is  to  be  executed  later,  the  file  must  be 

i assigned  to  tape  and  saved  when  CLASSIFY  is  executed. 

)■ 

I 

K 12.2  OUTPUT  FILES 

I 

t The  DISPLAY  processor  will  optionally  generate  a MAPFIL  tape  of 

t 

the  classified  image  for  display  on  the  DAS.  The  control  card 
FORMAT  allows  the  user  to  exercise  this  option.  When  requested, 

* the  tape  should  be  assigned  to  a nine-track  tape  drive  for  com- 

patibility with  the  DAS  tape  drives.  The  tape  assignment  must  be 
made  to  logical  unit  N (Fortran  unit  16) . 

!'  12.3  SCRATCH  FILES 

t 

i The  random  access  drum  file  is  used  as  a scratch  file  in  DISPLAY. 

\ No  assignment  is  necessary. 


12 . 4 CARD  INPUT 

All  system  formats  referred  to  in  this  section  are  defined  in 
section  3 . 


12.4.1  PROCESSOR  CARD 


The  processor  keyword  is  left  justified  starting  in  column  1; 
the  parameter  FILE  is  punched  starting  in  column  11.  For 
example , 

$DISPLAY  FILE=N 

This  card  directs  the  monitor  routine  to  select  the  DISPLAY 
processor  and  initiates  the  loading  of  routines  used  by  DISPLAY. 
Parameter  value  N is  the  file  number  on  the  MAPTAP  file  to  be 
processed;  if  not  input,  default  is  to  file  1 of  MAPTAP. 

12.4.2  SPECIAL  SYSTEM  DECKS 

No  special  decks  are  required  for  the  DISPLAY  processor. 

12.4.3  CONTROL  CARDS 

Table  12-1  lists  the  control  cards  and  available  options  for 
the  DISPLAY  processor. 

12.4.4  CLASS,  SUBCLASS,  AND  FIELD  DEFINITIONS 

Both  test  and  designated  fields  are  optional  input  to  DISPLAY. 
However,  both  types  of  fields  cannot  be  input  in  the  same  exe- 
cution of  DISPLAY.  If  no  test  fields  are  input,  the  ground- 
truth  summaries  will  be  for  training  fields.  When  input,  test 
fields  must  be  identified  with  a previously  defined  class  or 
subclass.  All  test  class,  subclass,  and  field  definitions  begin 
immediately  following  the  *END*  control  card  and  are  terminated 
by  the  $END*  control  card.  Formats  for  the  CLASSNAME,  SUBCLASS, 
and  field  definition  cards  are  defined  in  section  3.1.3.  The 
following  example  shows  test  field  input  to  DISPLAY.  Note  that 
test  fields  are  identified  with  classes;  that  is,  each  NAME1, 
NAME 2 , NAME3,  etc.,  must  match  the  name  of  a class  defined  in 
either  STAT  or  ISOCLS  and  used  in  CLASSIFY. 


$DISPLAY 
(Control  cards) 
*END* 

CLASSNAME 
FIELD1 
FIELD2 
CLASSNAME 
FIELDS 
FIELD4 
FIELDS 
CLASSNAME 
FIELD6 

Send* 


NAMEl 


NAME  2 


NAME  3 


In  the  following  example,  test  fields  are  identified  with  sub- 
classes, in  which  case  each  NAMEl,  NAME2,  NAME3,  etc.,  must 
match  the  name  of  a subclass  used  in  CLASSIFY. 


SDISPLAY 
(Control  cards) 

*END* 

SUBCLASS  NAMEl 

TEST  FIELDS  FOR  SUBCLASS  NAMEl 

SUBCLASS  NAME2 

TEST  FIELDS  FOR  SUBCLASS  NAME 2 

$END* 


Designated  fields  are  large  areas  within  the  classified  area 
which  are  either  unidentifiable  or  can  be  specifically  identi- 
fied as  being  other  than  the  crop  type  of  interest.  This  type 
of  field  input  is  meaningful  only  when  the  ITS  summary  report 
is  being  generated  for  one  specific  crop  type.  Pixels  within 
unidentifiable  areas  are  removed  from  the  summaries  altogether. 
Pixels  within  the  designated  ''other'1  areas  are  counted  as  other 
regardless  of  how  they  were  classified. 


An  example  of  input  designated  fields  follows. 
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$DISPLAY 
(Control  cards) 

♦END* 

DESIGNATE  UNIDENTIFIABLE 
(Field  definitions) 

DESIGNATE  OTHER 
(Field  definitions) 

$END* 

Either  one,  both,  or  neither  of  the  two  types  of  designated 
fields  may  be  input. 

12.4.5  DECK  SETUP 

The  deck  setup  for  the  DISPLAY  processor  is  given  in  figure  12-2. 
The  @ in  column  1 indicates  the  master  punch  for  the  Univac 
system,  which  is  the  7-8  multipunch. 

12.5  CARD  OUTPUT 

No  cards  are  output  by  the  DISPLAY  processor. 

12.6  SAMPLE  COMPUTER  RUNS 

Sample  computer  program  listings  are  given  in  figures  12-2,  12-3, 
and  12-4. 

Figure  12-3  illustrates  the  use  of  the  F-distribution  threshold- 
ing OPTION  FISHER.  A map  of  the  classified  data  on  tape  X15620 
is  printed  with  training  fields  outlined,  a classification  sum- 
mary for  each  training  field  classified,  and  statistics  for  each 
classified  subclass. 

Figure  12-4  is  a program  listing  of  DISPLAY  using  all  defaults. 
It  will  print  a map  of  classified  data  on  tape  V00833,  a classi- 
fication summary  for  each  field  classified,  and  a performance 
summary  by  training  classes.  No  thresholding  is  performed. 


a.?o 


Figure  12-5  demonstrates  input  for  generating  the  ITS  summary 
report  and  for  inputting  field  designations.  The  fields  were 
selected  to  demonstrate  the  input  rather  than  to  show  that  they 
were  unidentifiable  or  "other."  The  crop  type  CORN  was  defined 
as  a class  in  STAT  and  assigned  to  the  category  NONWH  in 
CLASSIFY.  Output  is  provided  for  this  run. 

See  appendix  H for  additional  output  by  the  DISPLAY  processor. 


12.7  RESTRICTIONS 

The  system-related  restrictions  given  in  section  5 apply  to 
this  processor. 


12 . 8 DIAGNOSTIC  MESSAGES 

See  section  6 for  further  diagnostic  messages. 

Message  Explanation 


*****DSPLAY/SETUP3  •••  ERROR 
CONDITION  ON  ATTEMPT  TO  POSI- 
TION MAPTAP  OVER  FILES. 

*****FSBSFL  STATUS  CODE  = 

ABORTING  RUN  *** 


***DSPLAY/SETUP3  CORE 

OVERFLOW  EXECUTION 

TERMINATED*** 


The  system  routine  for  posi- 
tioning files  (FSBSFL)  has 
encountered  difficulties  in 
positioning  MAPTAP  to  the  cor- 
rect file.  Error  occurred  in 
SETUPS  routine  for  DISPLAY. 

User  should  make  sure  that  the 
correct  file  number  for  the 
MAPTAP  has  been  indicated  and 
that  MAPTAP  does  in  fact  have 
the  correct  number  of  files. 

Subroutine  SETUP3  has  computed 
the  storage  needed  for  the 
specific  problem;  if  more  is 
needed  than  is  available, 
this  diagnostic  is  printed. 


Message 

AA0650  INVALID  SUPERVISOR 
CONTROL  CARD. 

WRITE  ON  UNIT  N TERMINATED 
ABNORMALLY.  MAPFIL  TAPE  NOT 
CREATED.  ISTAT  = 


END  OF  TAPE  ENCOUNTERED  ON 
MAPFIL  UNIT.  LAST  LINE  = N. 


*****FISHER  THRESHOLD 
REQUESTED  - NOT  PERFORMED 
•••NO.  SAMPLES  FOR  SUBCLASS 
NAME  (=N)  IS  LESS  THAN  OR  EQUAL 
TO  NUMBER  OF  CHANNELS  (=M) 

FDIST-OVERFLOW  CONDITION  IN 
FISHIN  ROUTINE  FOR 
SUBCLASS=XXXX ♦ THRESHOLD 
SET  TO  999.999 


Explanation 

The  invalid  card  is  printed 
along  with  this  message.  Check 
spelling  of  the  keyword. 

Attempt  to  write  on  MAPFIL 
output  tape  failed.  This  usu- 
ally indicates  a bad  tape. 

ISTAT  is  the  status  code 
returned  from  the  system  binary 
input/output  routine  NTRAN . 
Execution  continues. 

The  end-of-tape  marker  was 
encountered  on  the  MAPFIL  out- 
put tape.  The  last  line  written 
was  N.  Execution  of  DISPLAY 
continues  without  further 
attempts  to  write  on  the  MAPFIL 
tape. 

The  program  compares  the  number 
of  samples  to  the  number  of 
channels.  If  the  number  of 
samples  < number  of  channels, 
the  threshold  request  is 
bypassed. 

The  FISHIN  system  subroutine 
has  returned  an  overflow  con- 
dition. The  threshold  value 
is  set  to  999.999  by  the 
program . 


TABLE  12-1.-  DISPLAY  PROCESSOR  OPTIONS  AND  CONTROL  CARDS 


Keyword 

(a) 


Parameter  and 
default  values 
(b) 


OPTION  PLOT 


OPTION  CHI  SQUARE 

Default: C 


OPTION  FISHER 

Default: C 


OPTION  EMPIRICAL 

Default :C 


Function 

Plots  the  empirical  distri- 
bution functions  obtained 
from  the  cumulative  histo- 
grams of  Q^  for  each  subclass. 

Computes  thresholds  from  the 
chi-square  distribution  using 
the  confidence  levels  input 
on  the  THRESHOLD  control 
card. 

Computes  thresholds  from  the 
Fisher  F-distribution  using 
the  confidence  levels  input 
on  the  THRESHOLD  control 
card. 

Computes  the  empirical 
threshold  values  using  the 
percentages  input  on  the 
THRESHOLD  control  card. 

Uses  the  numbers  input  on  the 
THRESHOLD  control  card  for 
the  actual  threshold  value. 


aThe  keyword  must  be  left  justified  in  card  columns  1 through  10. 

^The  parameter  and  default  values  are  in  card  columns  11  through  72  (beginning 
in  any  column  past  10) . 

CIf  the  THRESHOLD  control  card  is  input,  one  of  the  four  options  (CHI  SQUARE, 
FISHER,  EMPIRICAL,  or  THRESHOLD  VALUE)  should  be  input  also.  If  the  OPTION 
card  is  omitted  and  the  THRESHOLD  card  is  input,  chi  square  is  assumed.  If 
more  than  one  THRESHOLD  option  is  input,  only  the  last  one  read  will  be 
performed. 
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TABLE  12-1.—  Continued 


Keyword 

SYMBOLS 


OPTION 


OPTION 


OPTION 


OPTION 


OPTION 


Parameter  and 

default  values  Function 


Sl#S2'”*Sk  Assigns  symbols  si>S2»***  to 

k=number  of  sub-  subclasses  1,2, 

classes  on  MAPTAP  respectively. 

Default:  1,2, •••9 

A , B ,C , D , • • • ,Z, 

1 1 2 , 3 , 4 


STAT 

Default: 

printed 


Prints  statistics  for  sub- 
No  statistics  classes  used  in  the  previous 
CLASSIFY  run.  These  statis- 
tics are  saved  on  the  MAPTAP. 


PCT 

Default:  Performance 

summary  printed  for 
classes  only 


OUTLINE 

Default:  Training 

fields  are  not 
outlined. 

NOMAP 

Default:  Map  printed 


Prints  a performance  summary 
on  a per-field  as  well  as  a 
per-class  basis  for  ground- 
truth  fields  (i.e. , training 
or  test  fields  within  the 
classified  image) . 

Outlines  training  fields 
with  asterisks;  has  no 
effect  on  test  fields.  (Test 
fields  are  always  outlined 
with  "+"  symbol.) 

Instructs  the  processor  not 
to  print  a map  of  the  data; 
only  a performance  summary 
is  printed. 


FILTER 

Default:  Spatial  fil- 

tering is  not 
performed. 


Performs  four-nearest-neighbors 
spatial  filtering  on  the  clas- 
sified image. 


TABLE  12-1.-  Continued 


Keyword 

THRESHOLD 


Parameter  and 
default  values 

VV'Tk 
k=number  of  sub- 
classes on  MAPTAP 
Default:  No 

thresholding 


Function 

Uses  the  threshold  values 
t^,t-» for  subclasses  1, 
2,**»,  respectively;  thresh- 
olds must  be  positive 
floating-point  numbers.  One 
value  must  be  specified  for 
each  subclass  on  the  MAPTAP 
file.  Thresholds  may  be 
specified  also  in  the  fol- 
lowing format: 

Nl*tl,N2*t2' * * ' 

where  N^  and  N2  are  integers 
which  specify  how  many  con- 
secutive times  the  correspond 
ing  thresholds  should  be  used 

For  the  CHI  SQUARE  and  the 
EMPIRICAL  options,  the  num- 
bers input  on  these  cards 
are  the  confidence  levels 
(i.e. , t^=0.99  means  that 
the  user  wants  to  maintain 
99%  or  reject  1%) . 

The  numbers  input  on  the 
OPTION  THRESHOLD  VALUE  card 
are  the  actual  values  to  be 
used  for  thresholding  (i.e. , 
t^=10.02  means  that  the 
threshold  value  for  sub- 
class 1 is  10.02). 


TABLE  12-1*—  Continued 


Keyword 

CROP 


ACREAGE 


SITE 

ANALYST 

PROCEDURE 


Parameter  and 
default  values 

NAME 

Default ; No  ITS 
report 


TOTAL=X, 

CROP=Y, 

OTHER=Z 


Any  24  characters 
Default:  Blanks 


Any  18  characters 
Default:  Blanks 

Any  60  characters 
Default:  Blanks 


Function 


Initiates  the  option  for 
printing  the  ITS  summary 
report  for  the  crop  indi- 
cated. NAME  must  match  a 
category,  class,  or  subclass 
name  used  in  CLASSIFY. 

The  total  acreage  in  the  ITS 
is  X;  acreage  of  the  crop 
named  on  the  CROP  control 
card  is  Y;  and  the  acreage 
of  all  other  crop  types  in 
the  ITS  is  Z.  X,  Y,  and  Z 
are  floating-point  numbers. 
This  input  is  meaningful 
only  if  the  CROP  control  card 
is  input. 

Name  of  the  ITS;  used  in 
printing  the  heading  for  the 
ITS  summary  report. 

Name  of  the  data  analyst 
printed  in  the  heading  for 
the  ITS  summary  report. 

Procedure  used  in  classifica- 
tion of  the  ITS ; printed  in 
the  heading  for  the  ITS 
summary  report. 


Keyword 

FORMAT 

HED1 

HED2 

DATE 

COMMENT 

♦END* 

$END* 


TABLE  12-1.—  Concluded. 


** 


Parameter  and 
default  values 

NAME 

Default:  No  output 

classification  map  tape 
is  generated  by 
DISPLAY. 


Any  60  characters 
beginning  in  column  11 
Default:  LYNDON  B. 

JOHNSON  SPACE  CENTER 

Any  60  characters 
beginning  in  column  11 
Default:  HOUSTON , 

TEXAS 

Any  12  characters 
beginning  in  column  11 
Default:  Current  date 

Any  60  characters 
beginning  in  column  11 
Default:  No  comments 

printed 

Blank 

Blank 


Function 

If  NAME=UNIVERSAL,  the  out- 
put classification  tape 
(MAPFIL)  will  be  generated 
in  the  Universal  format.  If 
NAME=LARSYS , the  MAPFIL  tape 
will  be  generated  in  the 
LARSYS  II  format. 

Replaces  first  header  line 
with  the  indicated  characters 
in  the  parameter  field. 

Replaces  second  header  line 
with  the  indicated  characters 
in  the  parameter  field. 

Prints  the  indicated  12  char- 
acters in  the  right  corner  of 
the  heading  in  place  of  the 
current  date. 

Prints  a comment  line  using 
the  60  characters  found  in 
the  parameter  field. 

Signals  the  end  of  the  con- 
trol cards. 

Signals  the  end  of  all  card 
input  for  the  processing 
function. 
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Figure  12-1.—  Concluded. 
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Figure  12-2.—  Deck  setup  for  the  DISPLAY  processor 
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Figure  12-2 Continued. 
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Figure  12-2.—  Continued. 


30 


- .....  AlJtJun 


(Bock  of  dock) 


Repeated  for 
all  test  classes  ^ 
or  subclasses  of 
interest. 


6 


6 


6 


@ EB  PMD 


$EXIT 


$END* 


( 


( 


< 


{ 


( 


(■ 


a 


(Variable  number 
field  definition 


of  test 
cards) 


CLASSNAME  or  SUBCLASS  card 


END* 


(Control  cards) 


$DISPLAY 


This  execution 
of  DISPLAY 
will  result  in 
a performance 
summary  for 
test  fields. 


$END< 


*END* 


(Control  cards) 


C 


$DISPLAY 


This  execution  of 
DISPLAY  will  result 
in  a performance 
summary  for  training 
fields,  since  no 
test  fields  are 
input. 


(Front  of  dock) 


Figure  12-2.—  Concluded. 
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Figure  12-3.—  Sample  program  listing  for  the  DISPLAY  processor 
showing  use  of  F-distribution  thresholding. 
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Figure  12-4.-  Sample  program  listing  for  the  DISPLAY  processor  using  all  defaults. 
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Figure  12-5.-  Sample  program  listing  and  output  for  the  DISPLAY  processor  showing 

generation  of  the  ITS  summary  report. 
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Figure  12-5.—  Continued. 
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Figure  12-5.—  Continued 
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Figure  12-5.-  Continued 
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Figure  12-5 . — Continued . 
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Figure  12-5.—  Continued. 
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Figure  12-5.—  Continued 
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Figure  12-5.-  Continued. 
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Figure  12-5.—  Continued 
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Figure  12-5.—  Concluded 


13.  DATA-TRANSFORMATION  PROCESSOR  - DATA-TR 


13 . 1 PROCEDURES 

The  DATA-TR  processor  transforms  images  from  the  MSS  data  tape 
(DATAPE) . The  linear  transformation  is  performed  on  user-defined 
fields  according  to  the  following  formula: 

z * BST  + b (13-1) 

where 
k < 16 
n < 30 

B = a k-by-n  input  transformation  matrix  (see  section  3. 1.4. 2) 
x = an  n-by-1  data  vector 

b = a k-by-1  bias  vector  (see  BIAS  control  card,  table  13-1) 
z = a k-by-1  transformed  data  vector 

The  user  has  the  option  of  rescaling  the  transformed  data  via  the 
RESCALE  control  card.  (Control  cards  are  listed  in  table  13-1.) 
If  no  rescaling  is  performed,  it  is  assumed  that  the  data  can  be 
represented  in  eight  bits  (0  < z < 255) . For  rescaling  the  data 
to  be  within  the  range  of  0 to  255,  the  following  equation  is 
computed . 

x |MINi  - Zi|  (13-2) 

where 

MIN^  = minimum  value  for  component  i 
Z^  = transformed  data  point  for  channel  i 

R^  = range  of  component  i [MAX^  - MIN^] 

MAX^  = maximum  value  for  component  i 

= rescaled  transformed  data  point  for  channel  i 
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The  user  may  obtain  the  parameters  and  MIN^  in  one  of  three 
ways:  the  histogram  method,  the  statistical  method,  or  user 

input.  The  method  and  control  cards  associated  with  each 
method  are  defined  below. 


13.1.1  HISTOGRAM  (DEFAULT)  METHOD 


A histogram  of  a segment  of  the  transformed  image  is  performed 
to  find  the  R^  and  MIN^  for  each  component  of  the  transformed 
data.  If  the  user-defined  field  is  smaller  than  2000  pixels, 
all  pixels  are  used  in  the  histogram;  otherwise,  the  following 
formula  is  used  to  determine  the  line  and  sample  increments 
needed  to  obtain  2000  points  for  the  histogram. 


a 


/ MN  \1/2 
\ 2000/ 


(13-3) 


where 

M = number  of  samples 
N = number  of  lines 
a = increment  (integer) 

In  deriving  an  approximate  range  for  the  transformed  data,  the 
user  may  specify  a percentage  of  points  to  be  excluded  from 
the  upper  and  lower  tails  of  the  histogram  by  using  the  PEROUT 
control  card.  If  not  so  specified,  2.5  percent  of  the  points 
on  the  tails  are  excluded  when  determining  the  MAX^  and  MIN^ 
values  of  the  central  95  percent  of  the  transformed  data 
distribution. 

Optionally,  the  user  may  specify  the  maximum  expected  data  value 
for  each  channel  n of  the  input  data  vector  x.  Otherwise,  the 
maximum  data  value  for  each  channel  is  set  equal  to  255. 


13.1.2  STATISTICAL  METHOD 

Activated  by  the  RESCALE  and  MODULE  or  STATFILE  control  card,  the 
statistical  method  is  applied  for  deriving  an  approximate  MAX^ 
and  MIN^  value  for  each  component  i.  Using  the  subclass  statis- 
tics, an  approximate  R^  is  computed  using  equations  (13-4)  and 
(13-5) . 

Let  ai  = MAXj(sj  + kc^)  (13-4) 

end  6i  = MIN.(e?  - kc^)  (13-5) 

where 

i = l,***,m  components  of  Z 

j = l,’**,w  subclasses 

k = an  integer  specified  by  the  user  (see  LAM  control  card) 

= transformed  mean  of  the  i th  component  of  subclass  j 

/v  *i  ^ 

= standard  deviation  of  the  i th  component  of  subclass  j com- 
puted from  the  transformed  covariance  matrix  for  subclass  j . 

The  approximate  range  of  each  component  will  be 

? i = l,***,m  (13-6) 

and  MINi  = 6i  (13-7) 

. 2 55 

Letting  the  scale  factor  = -g— , i = 1, •••,m,  the  complete 

transformation,  including  rescaling,  to  be  performed  on  the 
original  image  (each  pixel)  is: 

Yi  = ei  x l«i  - (BST  + b.)  | (13-8) 

where 

= i th  element  of  the  bias  vector  b 
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Optional  control  cards  that  may  be  used  in  conjunction  with  the 
statistical  method  are  SUBCLA,  LAM,  PEROUT,  OPTION  ORIG,  and 
OPTION  TRANSF.  Their  functions,  as  well  as  those  of  other  con- 
trol cards,  are  described  in  table  13-1. 

13.1.3  USER  INPUT  METHOD 

The  user  may  input  his  own  scaling  parameters  via  the  OPTION 
SCAFAC  control  card  or  use  input  from  a previous  execution  of 
DATA-TR  in  which  the  computed  scaling  parameters  (255/R^,MIN^) 
were  punched  on  cards  via  the  OPTION  PUNCH  control  card.  When 
inputting  the  previously  executed  deck,  the  user  should  note 
that  the  values  punched  were  computed  using  the  MIN^  and  MAX^ 
determined  after  a certain  percentage  of  the  tails  of  the  histo- 
grams (see  PEROUT  control  card)  were  discarded. 

The  transformed/rescaled  data  are  output  in  either  the  Universal 
or  LARSYS  II  format.  The  option  is  controlled  by  the  FORMAT 
control  card. 

A line  printer  plot  of  the  histogram  (frequency  distribution)  of 
the  transformed  rescaled  data  is  printed.  If  applicable,  the  MAX 
and  MIN^  are  printed. 

13.2  INPUT  FILES 

An  MSS  data  tape  (DATAPE)  must  be  input  to  the  DATA-TR  processor. 
The  tape  assignment  defaults  to  logical  unit  C (Fortran  unit  3) ; 
however,  by  input  of  the  DATAFILE  control  card,  the  user  may 
assign  any  available  logical  unit.  (See  table  4-1  for  file 
assignments  and  section  3.2,  MSS  Image  Data  Tapes,  for  further 
information  on  format,) 


13.3  OUTPUT  FILES 


The  transformed/rescaled  data  are  output  on  the  TRFORM  file, 
logical  unit  L (Fortran  unit  14) , in  either  the  Universal  or 
LARSYS  II  format. 

13.4  SCRATCH  FILES 

The  DATA-TR  processor  does  not  use  scratch  files. 

3.3. 5 CARD  INPUT 

All  system  card  input  forms  referred  to  in  this  section  are 
defined  in  section  3. 

13.5.1  PROCESSOR  CARD 

The  keyword  for  the  processor  card  is  left  justified  beginning 
in  column  1.  For  example, 

$DATA-TR 

This  keyword  directs  the  system  monitor  routine  to  select  the 
DATA-TR  processor  and  initiates  loading  of  routines  used  by 
DATA-TR. 


13.5.2  SPECIAL  SYSTEM  DECKS 

The  B-matrix  deck  discussed  in  section  3. 1.4. 2 must  be  input  to 
this  processor.  The  deck  may  be  obtained  from  a previous  exe- 
cution of  SELECT.  The  module  STAT  deck  (section  3,1.4. 1)  is 
optional  input.  If  input,  the  second  method  for  rescaling  the 
input  data  (given  in  section  13.1.2)  will  be  used. 


13.5.3  CONTROL  CARDS 


Table  13-1  lists  the  control  cards  and  available  options  for  the 
DATA-TR  processor. 
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13.5.4  FIELD  DEFINITIONS 


See  section  3.1.3  for  the  format  of  field  definition  cards.  At 
least  one  field  definition  card  must  immediately  follow  the  *END* 
control  card.  An  output  file  is  created  for  each  field  defini- 
tion input  and  is  written  on  unit  L.  Each  of  these  fields  con- 
sists of  a rectangular  field  which  surrounds  the  vertices  of 
the  input  field.  All  pixels  outside  the  input  field  and  within 
the  rectangular  output  field  are  set  equal  to  zero.  The  line 
and  sample  number  will  be  numbered  sequentially  from  1. 

13.5.5  DECK  SETUP 

The  deck  setup  for  the  DATA-TR  processor  is  given  in  figure  13-1. 


13.6  CARD  OUTPUT 

The  DATA-TR  processor,  via  the  OPTION  PUNCH  control  card, 
outputs  the  computed  scaling  parameters  on  cards.  Two  pairs  of 
scaling  parameters  are  punched  on  each  card;  i.e.,  each  punched 
card  contains  the  scaling  parameters  for  two  components  of  the 
transformed  data.  The  cards  must  be  used  in  the  same  order  as 
punched.  Their  formats  and  definitions  are  as  follows.  The 
number  of  cards  is  determined  by  the  number  of  components. 


Columns 

Type/ format 

Definition 

1-6 

A6 

OPTION 

11-17 

A7 

SCAFAC= 

18-27 

Al ,F9 . 3 ,F9 . 3 ,A1 

(CON1,MIN1) 

♦ 

where  CON^=255/R^,  R^  is  the  range  of 
component  1,  and  MIN^=minimum  value  for 

* 

component  1.  Parentheses  must  be  input. 

£ 

28-37 

Al ,F9 . 3 ,F9 . 3 ,A1 

(con2,min2) 

where  CON2=255/R2,  R2  is  the  range  of 

>■ 

component  2 and  MIN2=  minimum  value  for 

* V 

'*  P 

...I  ...  _ * 
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Columns  Type/format 


Definition 


component  2.  Additional  pairs  are  con- 
tinued on  succeeding  cards. 


13.7  SAMPLE  COMPUTER  RUNS 

Sample  computer  runs  are  given  in  figures  13-3,  13-4,  and  13-5. 

The  first  (fig.  13-3)  illustrates  the  situation  when  no  rescaling 
is  performed.  Output  (not  shown)  includes  a printout  of  the 
B-matrix,  the  definition  of  the  transformed  field,  and  a histogram 
of  the  transformed  data. 

Figure  13-4  illustrates  the  method  of  rescaling  the  transformed 
data  by  user  input  of  scaling  parameters  (OPTION  SCAFAC) . 

Figure  13-5  illustrates  the  method  of  rescaling  the  transformed 
data  by  the  statistical  method. 

13.8  RESTRICTIONS 

The  system-related  restrictions  in  section  17  apply  to  this 
processor. 

The  maximum  number  of  channels  allowed  is  30,  and  the  maximum 
number  of  components  in  the  transformed  vector  is  16. 

13. 9 DIAGNOSTIC  MESSAGES 

Message 

***  BAD  SUPERVISOR  CONTROL 
CARD  SETUPS  *** 

***  INVALID  CONTROL  CARD 
REJECTED  SETUP8  *** 

***  THE  CHANNEL  NUMBER  MUST 
LIE  BETWEEN  1-30  *** 


Explanation 

Check  spelling  of  keyword. 

Check  spelling  of  parameter. 
Incorrect  channel  number  input. 


Message 

***  BAD  FIELD  CARD  FROM 
LNTRAN  *** 

***  THE  NUMBER  OF  COMPONENTS 
IN  Y-VECTOR  TIMES  THE  NUMBER 
OF  SAMPLES  EXCEEDS  THE  SIZE  OF 
STORAGE  AREA  - TERMINATE  ***. 

SETREM  ERROR  - THERE  WERE 
XX  SCALE  FACTORS  AND  MINIMUM 
VALUES  INPUT  THROUGH  THE 
SCAFAC  OPTION.  YY  LINEAR 
COMBINATIONS  WERE  REQUESTED. 
THERE  MUST  BE  A SCALE  FACTOR 
AND  A MINIMUM  VALUE  FOR  EACH 
LINEAR  COMBINATION.  THE  PRO- 
GRAM WILL  TERMINATE  THROUGH 
CMERR. 


Explanation 

Check  format  of  field  card. 
Self-explanatory . 


This  message  indicates  that  the 
input  scaling  parameter  pairs 
are  not  in  one-to-one  corre- 
spondence with  the  number  of 
components  of  the  transformed 
data.  Too  many  or  too  few 
pairs  were  input. 


TABLE  13-1.-  DATA-TR  PROCESSOR  OPTIONS  AND  CONTROL  CARDS 


Keyword 

(a) 


Parameter  and 
default  values 
(b) 


Function 


B-MATRIX  CARDS  or  FILE 
Default:  None 


CARDS  indicates  that  the 
B-matrix  is  on  cards  immedi- 
ately following.  FILE  indi- 
cates that  the  B-matrix  is  on 
file  and  initiates  input  of 
the  BMFILE. 


FORMAT  OUTPUT=UNIVERSAL 

Default:  LARSYS  II 

FORMAT  OUTPUT=LARSYS 

Default:  LARSYS  II 


The  transformed  data  will  be 
output  in  Universal  format. 

The  transformed  data  will  be 
output  in  LARSYS  II  format. 


RESCALE 


Blanks 

Default:  No  rescaling 


Initiates  rescaling  of  the 
transformed  data  to  the  range 
of  0 to  255. 


DATAF I LE  UNIT=N , F I LE=M 

Default:  N=3 ,M=1 


STATFILE  UNIT=N , FILE=M 

Default:  N=1,M=1 


N is  the  Fortran  logical  unit 
number  to  which  the  MSS  data 
tape  (DATAPE)  has  been  assigned; 
M is  the  file  number  on  the 
tape  to  be  processed. 

N is  the  Fortran  logical  unit 
number  to  which  the  SAVTAP 
file  has  been  assigned;  M is 

(1)  the  file  member  on  the 
tape  to  be  processed  or 

(2)  , if  a module  STAT  deck  is 
input,  the  number  of  the  file 


aThe  keyword  must  be  left  justified  in  card  columns  1 through  10. 

b L 

The  parameter  and  default  values  are  in  card  columns  11  through  72 
in  any  column  past  10) . 
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(beginning 


TABLE  13-1.—  Continued 


Keyword 


OPTION 


OPTION 


MODULE 


MODULE 


SUBCLASS 


Parameter  and 
default  values 


PUNCH 

Default:  No  cards 

punched 

SCAFAC=(CON1,MIN1) , 

(con2,min2) , • • • , 

(CONi/MINi) 

Default:  Histogram 

method  of  rescaling 


FILE 

Default:  If  RESCALE 

is  input,  the  histo- 
gram method  is 
assumed. 

CARDS 

Default:  If  RESCALE 

is  input,  the  histo- 
gram method  is 
assumed. 


k<number  of  sub- 
classes on  SAVTAP<60 


Function 

on  which  to  store  the  training 
statistics.  If  M^l,  this  con- 
trol card  must  precede  the 
module  STAT  deck. 

Directs  the  program  to  punch 
the  scaling  parameters 
(CON^ ,MINi)  on  cards. 

CON  and  MIN  are  floating 
point  values  separated  by  a 
comma.  Blanks  between  the 
two  values  are  ignored.  The 
scaling  parameters  should  be 
ordered  according  to  the 
transformed  data  vector 
components . 

Initiates  reading  of  the 
SAVTAP  file;  if  rescaling  is 
performed , it  initiates  the 
statistical  method. 

Initiates  reading  of  the 
module  STAT  deck  that  must 
immediately  follow  this  card; 
if  rescaling  is  performed, 
it  initiates  the  statistical 
method . 

Integers  which  define  a sub- 
set of  subclasses  S^,S2 ,S2 , • • • 
from  the  input  statistics 


* 


% #' 


f 


Keyword 


LAM 


PEROUT 


MAXPT 


TABLE  13-1.—  Continued. 


Parameter  and 
default  values 

Default:  Statistics 

for  all  subclasses 
defined  are  used  in 
calculating  the 
scaling  factors. 

N 

Default:  N=2 


N 

Default:  N=5,  in 

which  case  5%  of  the 
total  distribution 
will  be  deleted  from 
both  the  upper  and 
lower  tails  of  the 
transformed  data  set. 


M1,M2,M3, • • • 
k<30 


Function 

(SAVTAP)  to  be  used  in  cal- 
culating the  scaling  factors 
and  approximating  R^. 


An  integer  multiplied  by  the 
standard  deviations  of  the 
input  subclass  statistics  to 
derive  an  approximate  range 
for  rescaling  the  transformed 
data. 

An  integer  which  specifies  the 
percentage  of  points  to  be 
deleted  from  the  upper  and 
lower  tails  of  the  trans- 
formed data  distribution  in 
computing  an  approximate 
range  for  rescaling.  For  the 
histogram  method  of  rescaling, 
N/2%  is  deleted  from  both  the 
upper  and  lower  tails  of  the 
histogram.  For  the  statisti- 
cal method  of  rescaling,  N% 
is  deleted  from  both  the  upper 
and  lower  tails  of  the 
histogram. 

Maximum  expected  value  of  MSS 
data  tape  (DATAPE)  input  for 
each  channel.  M's  are  integers 


Default:  255,255, 


TABLE  13-1.—  Continued. 


Keyword 


OPTION 


OPTION 


BIAS 


HEDl 


Parameter  and 
default  values 


ORIG 

Default:  No  sta- 

tistics printout 


Function 

used  in  deriving  an  approximate 
range  (MIN., MAX.)  of  the 
transformed  data  set  for  the 
histogram  method  of  rescaling. 

Initiates  the  printout  of  the 
original  (untransformed)  sta- 
tistics for  the  subclasses 
input  for  the  statistical 
rescaling  method. 


TRANSF 

Default:  No  sta- 

tistics printout 


bl'b2'b3'***'bk 

or 

N*bl ' bN+l ' * ' * ,bk 
k=number  of  components 

in  the  transformed 

data  set  and  N=an 

integer  repetition 

factor  for 

i<16 

Default:  b^=0.0 

Any  60  characters 
beginning  in  column  11 
Default:  LYNDON  B. 

•JOHNSON  SPACE  CENTER 


Initiates  the  printout  of  the 
transformed  statistics. 


All  b's  are  decimal  (floating 
point)  numbers,  separated  by 
commas;  they  comprise  the  bias 
vector  to  be  applied  in  the 
transformation  of  the  input 
data  set:  Z=B(x+b) 


Replaces  first  header  line 
with  the  indicated  characters 
in  the  parameter  field. 


TABLE  13-1.—  Concluded 


Keyword 

Parameter  and 
default  values 

Function 

HED2 

Any  60  characters 

Replaces  second  header  line 

beginning 

in  column 

11 

with  the  indicated  characters 

Default: 

HOUSTON , 

in  the  parameter  field. 

TEXAS 

DATE 

Any  12  characters 

Prints  the  indicated  12  charac- 

beginning 

in  column 

11 

ters  in  the  right  corner  of 

Default: 

Current  date 

the  heading  in  place  of  the 

current  date. 

COMMENT 

Any  60  characters 

Prints  a comment  line  using 

beginning 

in  column 

11 

the  60  characters  found  in 

Default : 

No  comments 

the  parameter  field. 

printed 

♦END* 

Blank 

Signals  the  end  of  the  control 
cards . 

SEND* 

Blank 

Signals  the  end  of  all  card 
input  for  the  processing 
function. 

Figure  13-1.—  Functional  flow  chart  for  the  DATA-TR  processor 
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Figure  13-2.—  Continued. 
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Figure  13-4.—  Sample  program  listing  and  output  for  the  DATA-TR  processor  of 

rescaling  by  input  of  the  scaling  parameters. 
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Figure  13-4.—  Concluded 
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Figure  13-5.—  Continued 
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14.  STATISTICS  TRANSFORMATION  PROCESSOR  - TRSTAT 


The  TRSTAT  processor  will  read  a SAVTAP  file  or  card  deck  gener- 
ated by  STAT  or  ISOCLS,  perform  a linear  transformation  on  the 
means  and  covariances,  and  output  the  transformed  statistics  on 
a new  file.  The  equation  for  the  linear  transformation  of  the 
means  is  as  follows: 

P'  = Ay  + b (14-1) 

where 

A = a k-by-n  matrix  (see  section  14.4.2);  k < 15  and  n i 30 
u = an  n-by-1  mean  vector 

b = a k-by-1  bias  vector  (see  card  type  4,  section  14.4.2) 
y ' as  a k-by-1  transformed  mean  vector 

The  equation  for  the  linear  transformation  of  the  covariances 
is  as  follows : 


K'  = AKAT  (14-2) 

where 

K = an  n-by-n  covariance  vector 

rp 

A = an  n-by-k  transpose  of  A 

K'  = a k-by-k  transformed  covariance  matrix 

14.1  INPUT  FILES 

A set  of  statistics  must  be  input  either  from  the  SAVTAP  file 
or  by  cards.  (See  STATFILE  or  MODULE  control  card,  table  14-1.) 

14.2  OUTPUT  FILES 

The  transformed  statistics  are  output  on  a file  in  the  SAVTAP 
format.  (See  section  4.1  and  STATFILE  control  card.) 


14.3  SCRATCH  FILES 


The  TRSTAT  processor  uses  no  scratch  files. 

14.4  CARD  INPUT 

All  system  formats  referred  to  in  this  section  are  defined  in 
sections  3 and  14.4.2. 

14.4.1  PROCESSOR  CARD 

The  processor  keyword  is  left  justified  starting  in  column  1. 

For  example, 

$ TRSTAT 

This  card  directs  the  monitor  routine  to  execute  the  TRSTAT 
processor  and  initiates  loading  of  routines  used  by  TRSTAT. 

14.4.2  A-MATRIX  DECK 

The  A-matrix  deck  is  composed  of  a transformation  matrix  and  an 
additive  bias  vector.  Its  format  is  shown  below.  For  additional 
information  on  the  transformation  matrix,  see  section  3. 1.4. 2. 


Card  type 

Columns 

Type/format 

Definition 

1 

1-8 

Alphanumeric 

Keyword  A-MATRIX. 

2 

6-7 

Integer/12 

Number  of  linear  combinations. 

13-14 

Integer/12 

Number  of  channels. 

17-80 

Integer/12 

Actual  channels  used  in 
computation. a 

3 

6-20 

Real/E15 . 8 

Element  1 

21-35 

• 

• 

Real/E15 . 8 

Element  2 

• 

66-80 

Rea 1/El 5. 8 

• 

Element  5 of  A-matrix  — 
the  full  matrix  is  punched, 

aThe  channels 

in  the  B- 

matrix  described  in 

section  3. 1.4. 2 begin  in  column  18. 

Card  type  Columns 


Definition 


4 


Type/format 


6-20  Real/El 5. 8 

21-35  Real/El 5. 8 


5 values  per  card.  Addi- 
tional elements  are  contin- 
ued on  succeeding  cards. 

Element  la 
Element  2 


66—80  Real/E15.8  Element  N of  b-vector, 

5 values  per  card,  with 
additional  values  continued 
on  succeeding  cards;  N=number 
of  linear  combinations. 


14.4.3  CONTROL  CARDS 

Table  14-1  lists  the  control  cards  and  available  options  for 
the  TRSTAT  processor. 


14.4.4  FIELD  DEFINITIONS 

No  field  definition  cards  are  input  to  TRSTAT. 

14.4.5  DECK  SETUP 

The  deck  setup  for  the  TRSTAT  processor  is  given  in  figure  14-1. 

14.5  CARD  OUTPUT 

The  transformed  statistics  deck  will  be  output  in  the  same  format 
as  the  module  STAT  deck. 


aUnlike  the  B-matrix  described  in  section  3. 1.4. 2,  the  A-matrix  deck  contains 
the  additive  vector. 


3<i3 


14.6  SAMPLE  COMPUTER  RUNS 


Figures  14-2  and  14-3  are  sample  program  listings  and  output  from 
the  TRSTAT  processor. 

14.7  RESTRICTIONS 

The  system-related  restrictions  in  section  17  apply  to  this 
processor. 

The  maximum  dimension  of  the  A-MATRIX  is  15  by  30,  and  the 
maximum  number  of  elements  in  the  additive  b-vector  is  30. 


14.8  DIAGNOSTIC  MESSAGES 
Message 

***  BAD  SUPERVISOR  CONTROL 
CARD  SETUP 9 *** 

NUMBER  OF  CHANNELS  FROM  STAT 
FILE  DOES  NOT  EQUAL  THE  NUMBER 
OF  CHANNELS  ON  A-MATRIX  FILE. 

CHANNELS  ON  STAT  FILE  = 

CHANNELS  ON  A-MATRIX  = 

INVALID  CONTROL  CARD  REJECTED 
***  SETUP9  *** 


Explanation 

Invalid  control  card.  Check 
spelling  of  keyword. 

Self-explanatory . 


The  parameter  field  of  the 
control  card  is  in  error. 


V 


TABLE  14-1.-  TRSTAT  PROCESSOR  OPTIONS  AND  CONTROL  CARDS 


Keyword 

(a) 


A-MATRIX 


OPTION 


MODULE 


STATFILE 


Parameter  and 
default  values 

(b) Function 


Blank 

Default:  Deck  must 

be  input 


Initiates  input  of  the  A-matrix 
and  b-vector.  The  A-matrix 
deck  (see  section  14.4.2  for 
format)  immediately  follows 
this  card. 


P ,0  ,T  P punches  the  transformed  sta- 

tistics; 0 prints  the  original 
statistics;  and  T prints  the 
transformed  statistics. 


Blank 


INPUT/UNIT=N , FILE=M , 
OUTPUT/UNIT=L , FILE=S 
Default:  N=l;  M=l; 

L=9 ; S=1 


Initiates  input  of  the  module 
STAT  deck,  which  immediately 
follows  this  card.  (See 
section  3. 1.4.1  for  module 
STAT  deck  format.) 

N is  the  Fortran  logical  unit 
number  to  which  the  tape  con- 
taining the  statistics  to  be 
transformed  has  been  assigned; 
M is  the  number  of  the  file  to 
be  processed;  L is  the  Fortran 
logical  unit  number  to  which 
the  transformed  statistics  are 
to  be  output;  and  S is  the 
number  of  the  next  file  to  be 
created  on  output  SAVTAP  file. 


aThe  keyword  must  be  left  justified  ir.  card  columns  1 through  10. 

^The  parameter  and  default  values  are  in  card  columns  11  through  72  (beginning 
in  any  column  past  10) . 


TABLE  14-1.—  Continued. 


Keyword 

CHANNELS 


SUBCLASS 


HED1 


HED2 


DATE 


COMMENT 


Parameter  and 
default  values 

N1,N2,N3,--,Nk 
k=number  of  matrix 
channels<30 
Default:  None 


S1'S2'S3' 


/ S. 


k<number  of  subclasses 
on  SAVTAP<60 
Default:  Statistics 

for  all  subclasses 
defined 

Any  60  characters 
beginning  in  column  11 
Default:  LYNDON  B. 

JOHNSON  SPACE  CENTER 

Any  60  characters 
beginning  in  column  11 
Default:  HOUSTON, 

TEXAS 

Any  12  characters 
beginning  in  column  11 
Default : Current  date 

Any  60  characters 
beginning  in  column  11 
Default:  No  comments 

printed 


Function 

N's  are  integer  channel  numbers 
referring  to  the  SAVTAP  file. 

The  number  of  channels  requested 
from  SAVTAP  must  be  equal  to  the 
number  of  channels  on  the 
A-matrix  file. 

Transforms  statistics  for  only 
subclasses  S3,S2 ,S3 , • • • . 


Replaces  first  header  line 
with  the  indicated  characters 
in  the  parameter  field. 

Replaces  second  header  line 
with  the  indicated  characters 
in  the  parameter  field. 

Prints  the  indicated  12  charac- 
ters in  the  right  corner  of  the 
heading  in  place  of  the  cur- 
rent date. 

Prints  a comment  line  using 
the  60  characters  found  in 
the  parameter  field. 


TABLE  14-1.—  Concluded 


Keyword 

*END* 

$END* 


Parameter  and 
default  values 

Blank 


Blank 


Function 

Signals  the  end  of  the  control 
cards. 

Signals  the  end  of  all  card 
input  for  the  processing 
function. 


(Front  of  dock) 


Figure  14-1. - Deck  setup  for  executing  the  TRSTAT  processor 

with  the  module  STAT  deck. 
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Figure  14-2.-  Continued 
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Figure  14-2.—  Continued 
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Figure  14-2  *—  Continued 
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Figure  14-2.—  Continued 
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Figure  14-3.—  Sample  program  listing  and  output  for  the  TRSTAT  processor  using 
options  and  executing  back  to  back  with  ISOCLS. 


15.  N-DIMENSIONAL  HISTOGRAM  PROCESSOR  - NDHIST 


The  NDHIST  processor  computes  an  n-dimensional  histogram  of  areas 
of  the  MSS  data  tape  (DATAPE)  of  which  the  user  has  requested 
scatter  plots.  The  pixel  dimensions  are  user  specified  by  the 
plotting  channels,  and  the  histogrammed  pixels  are  output  on  the 
NDIM  file.  The  file  is  written  as  an  interface  to  the  SCTPRL 
processor. 

15.1  PROCEDURES 

The  number  of  channels  (dimensions)  used  in  histogramming  is 
specified  by  means  of  the  CHANNELS  control  card  defined  in 
table  15-1.  The  plotting  channels  are  the  primary  input  channels. 
The  color  channels  are  for  further  delineation  of  the  frequency 
determination.  Thus,  the  total  dimensionality  n of  the  histo- 
gramming is  the  number  of  plotting  channels  plus  the  number  of 
color  channels. 

In  the  case  of  n > 2,  the  dimensionality  is  reduced  to  2 in  the 
SCTRPL  processor  by  means  of  a linear  transformation. 

Composed  of  the  plotting  channels,  each  unique  (positional)  pixel 
within  the  field  designates  the  position  of  a pixel  on  the  scatter 
plot  tape,  PLOTAP.  The  frequency  of  each  positional  pixel  is 
determined  as  a function  of  the  color  channels  (if  input)  and  the 
plotting  channels. 

If  only  plotting  channels  are  input,  both  the  positional  pixel 
and  its  frequency  are  calculated  using  plotting  channel  data. 

If  both  plotting  and  color  channels  are  input,  the  positional 
pixel  is  a function  of  the  plotting  channels,  and  the  frequency 
is  a function  of  both  sets  of  channels  as  follows: 

a.  If,  for  each  positional  pixel  x,  there  is  only  one  unique 


color  pixel  y,  f(x,y)  = f(x,y)  + 1. 


b.  If,  for  each  positional  pixel  x,  there  are  two  or  more  color 
pixels  y^,  i = l,2,3,»*»,n,  and  n < total  pixels  in  scene, 
f(x»yi)  = f(x,yj)  + 1 only  if  f(xryi)  = f(x,yj)  for  i $ j, 

c.  If,  for  two  or  more  positional  pixels  xif  i = 1,2,3, • • • ,n, 
and  n < total  pixels  in  the  scene,  there  is  only  one  unique 
color  pixel  y,  f(xi#y)  = f(Xj,y)  + 1 only  if  f(xi,y)  = f(x,.,y) 
for  i fi  j . 

The  color  assignment  for  each  unique  pixel  may  be  optionally  set 
by  the  NDHIST  or  the  SCTRPL  processor.  If  applicable,  the  color 
codes  are  output  on  the  NDIM  file.  The  color  codes  may  be  set 
using  the  following  information. 

a.  The  original  radiance  value  of  the  pixel  (see  CHANNELS  con- 
trol card,  table  15-1) . 

b.  The  mean  value  of  the  cluster  or  subclass  to  which  the  pixel 
was  assigned  during  clustering  or  classification.  In  exer- 
cising this  option,  the  user  must  input  a classification  or 
cluster  map  (see  MAPFIL  control  card,  table  15-1)  to  this 
processor.  To  execute  the  SCTRPL  processor,  a SAVTAP  file 
related  to  the  MAPFIL  must  be  input  (see  CHANNELS  and  STATFILE 
control  cards,  section  16,  table  16-1) . The  subclass  or 
cluster  numbers  assigned  to  the  pixel  during  classification 

or  clustering  are  stored  on  the  NDIM  file,  passed  to  the 
SCTRPL  processor,  and  used  for  retrieving  the  means  from  the 
SAVTAP  file. 

c.  The  mean  value  of  the  test  or  training  field  from  which  the 
pixel  was  extracted. 

d.  User-defined  colors  (see  COLOR  control  card,  table  16-1)  . 

e.  From  any  pass  on  the  MSS  data  tape  when  using  multiregistered 
Landsat  data  (see  CHANNELS  control  card,  table  15-1)  . 


The  areas  selected  for  histogramming  are  defined  by  test  and/or 
training  fields.  The  manner  in  which  the  fields  are  collected 
or  grouped  for  histogramming  is  user  controlled  by  input  param- 
eters. The  data  vectors  may  be  histogrammed  collectively  on  a 
class,  subclass,  or  per-field  level.  The  maximum  number  of 
fields  input  on  any  level  is  200,  and  the  maximum  number  of 
unique  data  vectors  accumulated  on  *my  level  is  12  000  divided  by 
one-fourth  the  number  of  plotting  channels. 

A functional  flow  diagram  of  the  NDHIST  processor  is  given  in 
figure  15-1. 

15.2  INPUT  FILES 

An  MSS  data  tape  (DATAPE)  must  be  input  to  the  NDHIST  processor. 

The  tape  assignment  defaults  to  logical  unit  C (Fortran  unit  3) ; 
however,  by  input  of  the  DATAFILE  control  card,  the  user  may  assign 
any  available  logical  unit.  (See  table  4-1  for  file  assignments 
and  section  3.2,  MSS  Image  Data  Tapes,  for  further  information.) 
Optionally,  a classification  or  cluster  MAPFIL  tape  may  be  input 
(see  MAPFIL  control  card) . 

15 . 3 OUTPUT  FILES 

A multifile  tape  is  always  output  (NDIM  file) . It  is  an  inter- 
face to  the  SCTRPL  processor  and  must  be  assigned  to  tape  or 
Fastrand.  No  file-skipping  capability  is  available;  the  first 
file  created  is  always  file  1.  (See  HISFIL  control  card, 
table  15-1,  and  appendix  H.) 

15.4  SCRATCH  FILES 

The  NDHIST  processor  dynamically  assigns  random  access  drum 
storage  for  the  histogram  counters,  color  codes,  identification 
information,  and  (optionally)  the  pixel  assignment  from  the 
classified  or  clustered  image  tape  (MAPFIL) . 


15.5  CARD  INPUT 


15.5.1  PROCESSOR  CARD 

The  processor  keyword  is  left  justified  starting  in  column  1. 

For  example, 

$NDHZST 

This  card  directs  the  system  monitor  routine  to  select  the  NDHIST 
processor  and  initiates  loading  of  all  the  NDHIST  routines  into 
the  system. 

15.5.2  SYSTEM  CARD  DECKS 

No  special  system  card  decks  are  required  for  the  NDHIST 
processor. 

15.5.3  CONTROL  CARDS 

Table  15-1  lists  the  control  cards  and  available  options  for 
the  NDHIST  processor. 

15.5.4  FIELD  DEFINITIONS 

The  field  cards,  which  immediately  follow  the  *END*  control  card, 
define  the  areas  to  be  histogrammed , and  the  OPTION  control  card 
determines  the  level  of  histogramming . The  fields  may  be  ordered 
in  one  of  four  ways: 

a.  As  input  to  STAT  (section  8.4.4) 

b.  As  input  to  ISOCLS  (section  9.5.4) 

c.  As  input  to  CLASSIFY  (section  11.5.4) 

d.  As  a user-defined  field  (section  3.1.3) 


3*7 


1 


8 


For  example: 

*END* 

CLASSNAME  WHEAT 

SUBCLASS  WHEAT1 

(Field  card  1) 
(Field  card  2) 
SUBCLASS  WHEAT 2 

(Field  card  3) 
CLASSNAME  NONWH 

SUBCLASS  NONWH 1 

(Field  card  4) 
SUBCLASS  NONWH 2 

(Field  card  5) 
SUBCLASS  NONWH 3 

(Field  card  6) 
(Field  card  7) 

$END* 


If  the  histogram  is  accumulated  on  class  bases,  fields  1,  2,  and 
3 are  histogrammed  collectively  and  output  as  data  file  1;  and 
fields  4,  5,  6,  and  7 are  histogrammed  collectively  and  output 
as  data  file  2. 


If  the  histogram  is  accumulated  on  subclass  bases,  fields  1 and 
2 are  histogrammed  collectively  and  output  as  data  file  1;  field  3 
is  histogrammed  and  output  as  data  file  2;  field  4 is  histogrammed 
and  output  as  data  file  3;  field  5 is  histogrammed  and  output  as 
data  file  4;  and  fields  6 and  7 are  histogrammed  collectively  and 
output  as  data  file  5. 

If  the  histogram  is  performed  on  per-field  bases,  each  field  is 
histogrammed  separately  and  output  to  a file,  making  a total  of 
seven  data  files  created. 

On  a cumulative  histogram,  a maximum  of  200  fields  may  be  input. 
See  section  3.1.3  for  format  of  the  field  definition  card. 


15.5.5  DECK  SETUP 

The  deck  setup  for  the  NDHIST  processor  is  given  in  figure  15-2. 


15.6  CARD  OUTPUT 


The  NDHIST  processor  does  not  provide  punched  card  output. 

15.7  SAMPLE  COMPUTER  RUNS 

Sample  runs  are  given  in  figures  15-3  and  15-4  and  appendix  H. 


15.8  RESTRICTIONS 


The  system-related  restrictions  in  section  17  apply  to  this 
processor.  Other  restrictions  are  as  follows. 

e A maximum  of  16  channels  may  be  histogrammed. 

e A maximum  of  4 channels  may  be  used  for  color  codes. 

• The  maximum  of  unique  vectors  to  be  histogrammed  is 


< 12  000 

- l/4 (number  of  channels) 


(15-1) 


• A maximum  of  4000  words  of  storage  is  allowed  for  storing  the 
MSS  data.  The  equation  for  computing  maximum  number  of 
pixels  is 


< 4000 ^ 

~ (number  of  channels (number  of  samples  per  scan  line) ] 


(15-2) 


15.9  DIAGNOSTIC  MESSAGES 
Message 


Explanation 


INVALID  CONTROL  CARD  - IGNORED.  Check  spelling  of  keyword. 

ERROR  ON  CHANNELS  CARD.  Check  parameter  field  of 

CHANNELS  control  card. 

ERROR  ON  DATA  FILE  CARD.  Check  parameter  field  of 

DATAFILE  control  card. 


ERROR  ON  MAP  FILE  CARD.  Check  parameter  field  of 

MAPFIL  control  card. 


^ 


Message 

^ p ERROR  ON  N-DIM  HISTOGRAM  FILE 
CARD. 

ERROR  ON  OPTION  CARD. 

— VECTORS  WERE  NOT  HISTO- 
G RAMMED,  BUT  USED  IN  COMPUTING 
FIELD  MEANS,  IF  APPLICABLE. 

I*  CORE  LIMITS  EXCEEDED.  MAXIMUM 
NO.  OF  VECTORS  ACCEPTED  IS  

TOO  MUCH  DATA  REQUESTED. 

REDUCE  NO.  OF  SAMPLES  PER  SCAN 
LINE  AND/OR  NO.  OF  CHANNELS. 

NOT  ENOUGH  DRUM  SPACE  TO  STORE 
MAP  TAPE  DATA. 


Explanation 

Check  parameter  field  of 
HISFIL  control  card. 

Check  parameter  field  of 
OPTION  control  card. 

The  histogrammed  vector  table 
is  full.  N number  of  unique 
vectors  were  not  histogrammed. 

Self -explanatory . 
Self-explanatory . 


Reduce  amount  of  data  being 
processed . 


t 

(# 


ilTllMfal— I 


...  * .1  ... 


TABLE  15-1.-  NDHIST  PROCESSOR  OPTIONS  AND  CONTROL  CARDS 
[All  m and  n are  integers . ] 


i 


f 

X ' f 


Keyword 

(a) 


Parameter  and 
default  values 
(b) 


CHANNELS  PLOT^n^ , n2 , n 3 , • • • , ni , 

COLOR=m1,m2#m3,  • • • ,m^ 
Default:  None 


Function 

The  n 1 s are  the  channels  for 
determining  the  position  (PLOT) 
of  the  pixels  to  be  output  on 
the  PLOTAP  tape  (i<16) . If 
i=2 , n^  defines  the  sample 
location  and  ^ the  line 
location  on  the  scatter  plot 
tape  (PLOTAP) . If  i>2 , the 
pixels  will  be  transformed  to 
two  components  in  the  SCTRPL 
processor;  component  1 will 
define  the  sample  location 
and  component  2 the  line  loca- 
tion on  the  scatter  plot  tape 
(PLOTAP).  The  m's  represent 
the  channels  for  the  color 
codes  < j <4 ) . If  the  COLOR 
channels  are  input,  the  histo- 
gram is  a function  of  both  the 
PLOT  and  COLOR  channels;  if 
the  COLOR  channels  are  omitted 
the  histogram  is  a function  of 
only  the  PLOT  channels.  (See 
section  16  for  further 
information . ) 


% 


% 


aThe  keyword  must  be  left  justified  in  card  columns  1 through  10. 

bThe  parameter  and  default  values  are  in  card  columns  11  through  72  (beginning  c 

in  any  column  past  10) . 

■*  V 

f. 

* 

9U  . 

m 


TABLE  15-1.—  Continued 


Keyword 

Parameter  and 
default  values 

Function 

DATAFILE 

UNIT=N , FILE=M 

N is  the  logical  unit  number 

Default: 

N=3,M=1 

assigned  to  the  MSS  data  tape 
(DATAPE) ; M is  the  file  number 
of  the  data  to  be  processed. 

MAPFILE 

UNIT=N , FILE=M 

N is  the  logical  unit  number 

Default : 

None 

assigned  to  the  MAPFIL  tape; 

M is  the  file  number  of  the 
data  to  be  processed.  (The 
order  of  the  fields  to  be  his- 
togrammed  must  correspond  to 
the  order  of  the  clustered  or 
classified  fields  on  the  input 
MAPFIL  tape.) 

HISFIL 

UNIT=N 

The  logical  unit  number 

Default: 

il 

2 

assigned  to  the  NDIM  file. 

OPTION 

CLASS 

Fields  will  be  histogrammed 

Default : 

Field  bases 

on  class  bases. 

OPTION 

SUBCLS 

Fields  will  be  histogrammed 

Default: 

Field  bases 

on  subclass  bases. 

OPTION 

FIELD 

Fields  will  be  histogrammed 

Default: 

Field  bases 

on  per-field  bases. 

OPTION 

MEANS 

The  means  of  each  field  will  be 

computed  for  the  COLOR  channels 
on  the  CHANNELS  card  and  output 
on  the  NDIM  file. 


i 


TABLE  15-1.— 

Concluded. 

Keyword 

Parameter  and 
default  values 

Function 

HED1 

Any  60  characters 
beginning  in  column  11 
Default:  LYNDON  B. 

JOHNSON  SPACE  CENTER 

Replaces  first  header  line 
with  the  indicated  characters 
in  the  parameter  field. 

HED2 

Any  60  characters 
beginning  in  column  11 
Default:  HOUSTON, 

TEXAS 

Replaces  second  header  line 
with  the  indicated  characters 
in  the  parameter  field. 

DATE 

Any  12  characters 
beginning  in  column  11 
Default:  Current  date 

Prints  the  indicated  12  charac- 
ters in  the  right  corner  of 
the  heading  in  place  of  the 
current  date. 

COMMENT 

Any  60  characters 
beginning  in  column  11 
Default:  No  comments 

printed 

Prints  a comment  line  using 
the  60  characters  found  in 
the  parameter  field. 

♦END* 

Blank 

Signals  the  end  of  the  control 
cards. 

$END* 

Blank 

Signals  the  end  of  all  card 
input  for  the  processing 
function. 

(Control  cards) 


0 XQT  LARSAA 


0 XQT  CUR 


@ ASG  Z« (program  tape  number) 


@ ASG  N*=  (MAPFIL  file  number) 


I § ASG  E= (PLOTAP  file  number) 

/ 7 Saved  from  a previous 

( § ASG  A= (SAVTAP  file  number)  [execution  of  the  STAT 
) processor 

@ ASG  C= (MSS  DATAPE  number) 


@ ASG  D,H 


Assigns  NDIM  file  and 
BMFILE  to  Fastrand 


RUN  — 


( Front  of  dock) 


(b)  For  execution  back  to  back  with  SCTRPL, 
Figure  15-2.—  Continued. 


3 ' 


« ..t.. 


k of  dock) 


i 


Pixel  color  assignment 
will  be  extracted  from 
SAVTAP  file. 

:els  on  NDIM  file  will 
transformed . 


;d. 


i 

i 

J 

\ 


Figure  15-3.—  Sample  program  listing  and  output  for  the  NDHIST  processor  using 
defaults  and  executing  back  to  back  with  the  SCTRPL  processor. 


Figure  15-3.—  Continued. 


Figure  15-3.—  Continued 
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Figure  15-3.—  Continued. 
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Figure  15-3.-  Continued. 
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Figure  15-4.-  Sample  program  listing  for  the  NDHIST  processor  using  options. 


16.  SCATTER  PLOT  PROCESSOR  - SCTRPL 


The  SCTRPL  processor  reads  the  NDIM  file  written  by  the  NDHIST 
processor,  determines  the  location  of  each  unique  data  vector  on 
the  scatter  plot,  and  outputs  the  spectral  plot  in  the  Universal 
format.  A scatter  plot  is  created  and  output  to  tape  for  each 
file  stored  on  NDIM. 

The  location  (line  and  sample  intersection)  of  each  pixel  on  the 
two-axis  scatter  plot  will  be  computed  using  either  the  radiance 
values  or  two  linear  combinations  of  radiance  values.  (This 
option  is  controlled  in  section  15  by  the  CHANNELS  control  card, 
table  15-1.) 

If  the  data  vector  is  to  be  transformed  (see  B-MATRIX  control 
card,  table  16-1) , the  following  equation  and  conditions  will 
be  applied: 

y = Bx  + c (16-1) 

where 

y ■-  a 2-by-l  vector 
B = a 2-by-n  vector,  n < 16 
x = an  n-by-1  vector,  n < 16 
c = a 2-by-l  vector 

If  the  transformed  data  are  to  be  rescaled  (see  SCALE  control 
card,  table  16-1),  the  following  equation  will  be  applied: 

(HI.  - L0.\ 

-^rT M x |MIN.  - Zi|  (16-2) 

where 

HI^  = an  input  parameter  for  the  upper  rescale  limit  for 
channel  i 


LO^  = an  input  parameter  for  the  lower  rescale  limit  for 
channel  i 

MIN^  = minimum  value  for  channel  i 

= transformed  data  point  for  channel  i 

= range  (MAX^  - MIN^)  for  channel  i 

Y.  = rescaled  transformed  data  value  for  channel  i 
r 

MAX^  = maximum  value  for  channel  i 

The  scatter  plot  is  created  and  output  line  by  line  to  tape.  All 
the  pixels  belonging  to  a line,  as  determined  by  the  second  coor- 
dinate of  the  pixel,  are  collected;  and,  in  the  sample  location 
determined  by  the  first  coordinate,  the  color  assignment  and 
frequency  of  occurrence  of  each  pixel  are  output  to  tape  as 
channels  1 through  n,  respectively.  [See  procedures  for  NDHIST 
processor  (section  15.1)  for  definition  of  color  assignments,] 

The  dimensions  of  the  output  tape  are  user  controlled  by  the 
following  input  control  cards,  which  are  defined  in  greater 
detail  in  table  16-1  (the  control  cards  for  the  SCTRPL  processor) 
and  table  15-1  (the  control  cards  for  the  NDHIST  processor) . 

Keyword  Parameters 

SIZE  XLOW=0,XHIGH=128,XSIZ=129,YLOW=0, 

YHIGH=64,YSIZ=65 

CHANNELS  PLOT=  3,4, COLOR=  5 , 6 , 7 , 8 

The  output  tape  will  contain: 

a.  129  samples  per  scan  line  with  a maximum  data  resolution  of 
128  on  channel  3. 

b.  65  lines  per  file  with  a maximum  data  resolution  of  64  on 
channel  4. 


c.  cj  channel.4!  with  channels  1 through  4 containing  the  color 
pixel  (determined  by  channels  5,  6,  7,  and  8 on  CHANNELS 
control  card)  and  channel  5 the  frequency. 

d.  A color  pixel  for  each  positional  pixel  determined  by 
channels  3 and  4 on  the  CHANNELS  control  card. 

If  a MAPFIL  tape  containing  the  subclass  or  cluster  numbers  is 
input  to  the  NDHIST  processor,  either  a SAVTAP  file  related  to 
the  MAPFIL  tape  must  be  input  (see  STATFILE  control  card)  or 
the  user  must  input  the  color  codes  on  cards  (see  COLOR  control 
card) . 

Optionally,  a line  printer  pixel  frequency  scatter  plot  will  be 
output  (see  PIXPLT  control  card) . The  frequency  of  occurrence 
or  log  of  frequency  of  occurrence  will  be  represented  by  a symbol 
(see  SYMBOL  control  card) . The  location  of  the  symbol  on  the 
plot  will  be  determined  by  the  radiance  value  of  the  pixel.  If 
the  data  have  been  transformed,  then  the  data  must  be  rescaled 
to  exercise  this  option. 

A functional  flow  diagram  of  the  SCTRPL  processor  is  given  in 
figure  16-1. 

16.1  INPUT  FILES 

The  NDIM  file  created  by  NDHIST  must  be  input.  (See  the  HISFIL 
control  card  and  appendix  D for  format  of  the  NDIM  tape.) 

The  SAVTAP  file  created  by  the  STAT  or  ISOCLS  processor  option- 
ally may  be  input.  (See  the  STATFILE  control  card  and  sec- 
tion 4.1  for  a description  of  the  file.) 


16.2  OUTPUT  FILES 


A multifile  Universal-formatted  tape  containing  the  spectral 
plots  and  color  keys,  when  applicable,  will  be  output.  (See  the 
PLOTAP  control  card  and  appendix  F for  tape  format.) 

16.3  SCRATCH  FILES 

The  program  dynamically  assigns  random  access  drum  storage  for 
scratch  files. 

16.4  CARD  INPUT 

16.4.1  PROCESSOR  CARD 

The  processor  keyword  is  left  justified  starting  in  column  1. 

For  example, 

$SCTRPL 

This  card  directs  the  system  monitor  routine  to  select  the 
SCTRPL  processor  and  initiates  loading  of  all  the  SCTRPL  rou- 
tines into  the  system. 

16.4.2  SYSTEM  CARD  DECKS 

The  module  STAT  and  B-matrix  decks  may  be  input.  See  section  3 
for  formats. 

16.4.3  CONTROL  CARDS 

Table  16-1  lists  the  options  and  control  cards  for  the  SCTRPL 
processor. 

16 .4.4  FIELD  DEFINITIONS 

Field  definitions  do  not  apply  to  this  processor. 

16.4.5  DECK  SETUP 

Figure  16-2  is  a sample  deck  setup  for  the  SCTRPL  processor. 


16.5  CARD  OUTPUT 


The  SCTRPL  processor  does  not  provide  punched  card  output. 

16.6  SAMPLE  COMPUTER  RUNS 

Figure  16-3  is  a sample  computer  listing  illustrating  the  use 
of  all  defaults. 

Figure  16-4  is  an  example  of  executing  various  processors  back 
to  back  with  SCTRPL.  The  end  result  is  to  output  the  transformed 
cluster  means  as  the  color  code  for  each  pixel  output  to  the 
PLOTAP  file. 

16.7  RESTRICTIONS 

The  system-related  restrictions  in  section  17  apply  to  this 
processor. 

If  the  color  codes  for  the  scatter  plot  tape  PLOTAP  are  to  be 
principal  component  (PC)  colors,  the  user  must  ensure  that  the 
values  are  positive. 

The  maximum  dimension  of  the  B-matrix  is  2 by  16;  the  maximum 
number  of  elements  in  additive  vector  b is  16. 

The  maximum  number  of  channels  on  the  output  tape  PLOTAP  is  5, 
Color  codes  are  the  first  n-1  channels;  the  frequency  is  the  nth 
channel. 


The  maximum  number  of  channels  selected  from  the  SAVTAP  file  is  4. 

The  maximum  size  of  the  output  tape  PLOTAP  is  200  samples  per  scan 
line  and  200  lines. 


16.8  DIAGNOSTIC  MESSAGES 


Message 


Explanation 


INVALID  CONTROL  CARD  — 
IGNORED. 

ERROR  ON  CHANNELS  CARD. 

ERROR  ON  STAT  FILE  CARD. 

ERROR  ON  NDIM  HISTOGRAM  FILE 
CARD. 


Check  spelling  of  keyword. 

Check  parameter  field  of 
CHANNELS  card. 

Check  parameter  field  of 
STATFILE  card. 

Check  parameter  field  of  HISFIL 
card. 


ERROR  ON  OPTION  CARD. 

ERROR  ON  TAPE  3IZE  CARD. 

ERROR  ON  SCATTER  PLOT  TAPE 
CARD. 

ERROR  ON  SCALING  CARD. 


Check  parameter 
card. 

Check  parameter 
card. 

Check  parameter 
PLOTAP  card. 

Check  parameter 
card. 


field  of  OPTION 
field  of  SIZE 
field  of 
field  of  SCALE 


DATA  MUST  BE  RESCALED  BEFORE 
PIXEL  FREQUENCY  PLOT  OPTION  MAY 
BE  SELECTED . 

NO.  OF  PLOTTING  CHANNELS,  NO. 

OF  B-MATRIX  CHANNELS  MUST  BE 

EQUAL.  CHANNELS  ARE  , 

RESPECTIVELY. 

ERROR  IN  POSITIONING  N-DIM 
HIST  FILE  TO  FILE 


Transformed  data  must  be 
rescaled  for  line  printer  plot. 

Number  of  channels  to  be  trans- 
formed, must  equal  the  number 
of  channels  in  transformation 
matrix. 

Physical  tape  error  occurred. 
Resubmit  run. 


Message  Explanation 

f A TOTAL  OF  POINTS  WERE  Data  may  be  rescaled  to  a 

NOT  DISPLAYED  ON  THE  LINE  resolution  of  100. 

PRINTER  GRAPH.  THE  POINTS  WERE 
OUT  OF  RANGE  IN  EITHER  THE 
X DIRECTION  OR  Y DIRECTION. 


¥■/*>— 


TABLE  16-1.-  SCTRPL  PROCESSOR  OPTIONS  AND  CONTROL  CARDS 


Keyword 

Parameter  and 
default  values 

(a) 

(b) 

Function 

CHANNELS 

nl'n2'n3' * “ 'ni 
i<number  of  channels 

on  SAVTAP <30 

Default:  First 

4 channels  from  NDIM 

file 

Statistics  for  these  channels 
will  be  extracted  from  the 
SAVTAP  file?  n^  must  be  a sub- 
set of  channels  on  the  SAVTAP 
file . 

STATFILE 

UNIT=N , FILE=M 
Default:  None 

N is  the  logical  unit  nusnber 
assigned  to  the  SAVTAP  file? 
M is  the  number  of  the  file 
to  be  processed. 

HISFIL 

UNIT=N 

Default:  N=4 

N is  the  logical  unit  number 
assigned  to  the  NDIM  file. 

PIXPLT 

LOG 

Line  printer  pixel  scatter 
plot  of  the  log  of  frequency 
of  occurrence  will  be  printed. 

PIXPLT 

FREQ 

Line  printer  pixel  scatter 
plot  of  the  frequency  of 
occurrence  will  be  printed. 

PIXPLT 

RESCALE 
Default:  No 

rescaling.  XSIZ=101, 
YSIZ=101;  the  range 
for  x-axis  is  XLOW+ 
XSIZ-1?  the  range  for 
y-axis  is  YLOW+YSIZ-1. 

The  frequency  of  occurrence 
of  the  pixel  for  the  line 
printer  scatter  plot  will  be 
rescaled  to  ranges  XLOW,  XHIGH 
YLOW,  and  YHIGH . XSIZ  will 
determine  the  number  of  bins 
on  the  x-axis?  YSIZ,  the 

aThe  keyword  must  be  left  justified  in  card  columns  1 through  10. 

b ' 

The  parameter  and  default  values  are  in  card  columns  11  through  72  (beginning 

in  any  column  past  10) . 


TABLE  16-1.—  Continued 


Keyword 


COLOR 


SIZE 


SIZE 


SIZE 


SIZE 


SIZE 


SIZE 


Parameter  and 
default  values 


(n^) , (m2) , • • • , (mp) 
or 

L*(m1),  K*(mL+1) 

L and  K are  integer 
repetition  factors. 
Default:  No  user 

input  of  color  codes 


XSIZ=N 

Default: 

YSIZ=N 

Default: 

XHIGH=N 
Default : 

XLOW=N 
Default : 

YHIGH=N 

Default: 

YLOW=N 

Default: 


XSIZ=101 


YSIZ=101 


XHIGH=100 


X LOW-O 


YHIGH=100 


YLOW=0 


Function 

number  of  bins  on  the  y-axis. 
(See  SIZE  control  cards.) 

ml=nl'n2'  ’ * * 'nfi.  *s  color 
assignment  for  cluster  1; 

m2=n1, n2 , • • • ,n^  is  the  color 

assignment  for  cluster  2; 

m =n^#n2, • • • ,n^  is  the  color 

assignment  for  cluster  n. 

p<60  and  S,<4;  0<n^<255. 

The  number  of  samples  per  line 
to  output  on  the  scatter  plot 
tape;  N<200. 

The  number  of  lines  to  output 
on  the  scatter  plot  tape; 
N<200. 

The  upper  limit  of  the  radi- 
ance value  for  the  sample  axis 
(x-axis) ; N<255. 

The  lower  limit  of  the  radi- 
ance value  for  the  sample  axis 
(x-axis);  0<N<XHIGH. 

The  upper  limit  of  the  radi- 
ance value  for  the  line  axis 
(y-axis);  N<255. 

The  lower  limit  of  the  radi- 
ance value  for  the  line  axis 
(y-axis);  0<N<255. 


TABLE  16-1. ~ Continued 


Parameter  and 


Keyword 

default  values 

Function 

PLOTAP 

UNIT=N 
Default : 

N=7 

N is  the  logical  unit  number 
assigned  to  the  spectral  plot 
tape . 

B-MATRIX 

CARDS 
Default : 

None 

The  B-matrix  is  being  input  by 
cards . 

B-MATRIX 

FILE 
Default : 

None 

The  B-matrix  is  being  input  by 
file . 

BVEC 

T . T • • • T 

xl'  2'  ' n 

n=number  of  linear 

combinations  in 

B-matrix<2 

Default:  T =0.0 

n 

Elements  of  the  additive 
vector  to  be  used  in  the 
transformation;  T is  a 
floating-point  number. 

BCKGND 

N 

Default : 

N=255 

If  N=0 , background  will  be 
black;  if  N=255,  background 
will  be  white. 

SCALE 

FILE 

The  scale  factors  will  be  com- 
puted from  the  NDIM  file. 

SCALE 

XMAX=T 
Default : 
computed 
file.c 

XMAX  will  be 
from  the  NDIM 

The  upper  range  for  the  trans- 
formation of  the  sample  values 
(x-axis) ; T is  a floating- 
point number. 

SCALE 

XMIN=T 

The  lower  range  for  the  trans- 

Default:  XMIN  will  be 
computed  from  the  NDIM 
file.0 


formation  of  the  sample  values 
(x-axis) ; T is  a floating- 
point number. 


'If  one  of  the  parameters  XMIN,  XMAX,  YMIN,  or  YMAX  is  input,  all  four  param- 
eters must  be  input. 


TABLE  16-1 


Continued 


f 

I 


Function 

The  upper  range  for  the  trans- 
formation of  the  line  values 
(y-axis) ; T is  a floating- 
point number. 

The  lower  range  for  the  trans- 
formation of  the  line  values 
(y-axis) ; T is  a floating- 
point number. 

The  transformed  data  will  be 
rescaled  to  the  range  of  XLOW, 
XHIGH , YLOW,  and  YHIGH . (See 
SIZE  control  card.) 

Replaces  first  header  line 
with  the  indicated  characters 
in  the  parameter  field. 

Replaces  second  header  line 
with  the  indicated  characters 
in  the  parameter  field. 

Prints  the  indicated  12  charac- 
ters in  the  right  corner  of  the 
heading  in  place  of  the  current 
date. 

N»  or  YMAX  is  input,  all  four  param- 


TABLE  16-1.—  Concluded 


Keyword 

Parameter  and 
default  values 

Function 

SYMBOLS 

S1'S2'S3' * * * 'Sk 
k<32 

Default:  Two  sets 

of  10  symbols 
overprinted 

Character  set  separated  by 
commas,  with  a maximum  of 
32  characters.  The  number  of 
symbols/2  determines  the  num- 
ber of  bin  levels.  The  first 
set  of  symbols  is  overprinted 
by  the  second  set.  A blank  is 
not  a legitimate  character. 

MODULE 

Blank 

Initiates  the  input  of  the 
module  STAT  deck  which  immedi- 
ately precedes  this  card  (see 
section  3. 1.4.1  for  format). 

COMMENT 

Any  60  characters 
beginning  in  column  11 
Default:  No  comments 

printed 

Prints  a comment  line  using 
the  60  characters  found  in  the 
parameter  field. 

♦END* 

Blank 

Signals  the  end  of  the  control 
cards . 

$END* 

Blank 

Signals  the  end  of  all  card 
input  for  the  processing 
function . 

j 0 ASG  D= (NDIM  file  number) 

@ ASG  E | Assigns  PLOTAP  file  to  Fastrand 

RUN  — 

(Front  of  dock) 

(a)  For  independent  execution. 

Figure  16-2.—  Deck  setup  for  the  SCTRPL  processor. 
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Figure  16-3.—  Sample  program  listing  for  the  SCTRPL  processor  using  defaults. 
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Figure  16-4.—  Sample  program  listing  and  output  for  the  SCTRPL  processor  using  options 
and  executing  back  to  back  with  ISOCLS,  DATA-TR,  STAT,  and  NDHIST. 
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Figure  16-4.—  Continued 
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Figure  16-4.—  Continued. 
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Figure  16-4.—  Concluded 


17.  SYSTEM  RESTRICTIONS 


The  system  is  limited  in  every  processor  to  processing  no  more 
than  30  channels  of  data.  The  MSS  data  tape  (DATAPE)  may  have 
more  than  30  channels,  but  for  processing  purposes  a subset  of 
those  channels  must  be  selected  via  the  CHANNELS  control  card. 

A maximum  of  60  categories,  classes,  and  subclasses  may  be  proc- 
essed. However,  the  maximum  number  of  channels  and  subclasses 
may  not  be  processed  at  one  time.  The  arrays  within  the  system 
are  dimensioned  variably  according  to  user  requests.  The  amount 
of  storage  available  will  not  accommodate  the  arrays  which  are 
dimensioned  number  of  subclasses  by  number  of  channels,  if  both 
maximums  are  used.  Restrictions  under  the  STAT,  SELECT,  and 
CLASSIFY  processors  allow  the  user  to  compute  approximately 
whether  or  not  the  numbers  of  channels  and  subclasses  selected 
are  acceptable.  When  core  storage  requirements  are  exceeded, 
a diagnostic  message  is  printed,  and  the  user  must  reduce  his 
requirements  in  order  to  get  a successful  execution. 


18.  SYSTEM  DIAGNOSTIC  MESSAGES 


The  diagnostic  messages  listed  in  this  section  are  printed  by 
utility  routines  used  by  more  than  one  processor.  Diagnostics 
which  occur  only  in  a given  processor  are  listed  in  the  sec- 
tions for  individual  processors. 


18.1  CLSCHK  ROUTINE 
Message 

a.  **  CLSCHK**  - REQUESTED  SUBCLASS 
NO.  XXX  IS  NOT  AVAILABLE  IN 
INPUT  STATISTICS  — REQUEST 
IGNORED. 


b.  **  CLSCHK**  - REQUESTED  SUBCLASS 
NO.  XXX  FOR  GROUP  NO.  XXX  IS  NOT 
AVAILABLE  IN  INPUT  STATISTICS 
FILE. 


Explanation 

Either  by  the  SUBCLASS  control  card 
or  by  default,  a subclass  number  has 
been  requested  to  be  used  in  classi- 
fication which  is  greater  than  the 
largest  subclass  number  available  in 
the  input  training  subclasses.  The 
CLASSIFY  processor  ignores  the 
requested  subclass  number  and  deletes 
it  as  a possibility  for  use  in 
classification. 

A subclass  number  input  by  either  the 
SUBCLASS  or  the  GROUP  control  <j:ard  is 
greater  than  the  largest  training  sub- 
class number  available.  The  requested 
subclass  will  be  deleted  and  ignored 
in  classification. 


18 . 2 FETCHK  (CLSCHK)  ROUTINE 


Message 

a.  *****  CLSFY /FETCHK  CHANNEL 

XXX  NOT  IN  TRAINING  DATA  

TRAINING  DATA  CHANNELS  ARE 


C1'C2'C3' 


N 


Explanation 

A channel  requested  on  the  CHANNELS 
control  card  or  in  the  B-matrix  input 
for  use  in  classification  is  not 
available  in  the  set  of  channels 
which  was  used  to  obtain  training 


& V 


I 


Message 


b.  *****  CHANNEL  XX  IGNORED  (NOT 
USED)  IN  CLASSIFICATION. 


C.  *****  b-MATRIX  CHANNELS  MUST  BE 
EQUAL  TO  OR  A SUBSET  OF  AVAIL- 
ABLE TRAINING  DATA  CHANNELS  

THE  INPUT  B-MATRIX  CHANNEL  SET 
IS  Bi,b2,...  bm. 

*****  TERMINATING  PROGRAM  EXECU- 
TION FROM  FETCHK  ***** 

18.3  LAREAD  ROUTINE 
Message 

ERROR  IN  FIELD  CARD  TERMINATING 


Explanation 

subclass  statistics.  The  available 
set  of  channels  in  the  training  sta- 
tistics is  printed  out  as  part  of  the 
diagnostic  message.  Messages  b and  c 
are  added  to  this  when  the  B-matrix 
is  not  involved  (b)  and  when  B-matrix 
channels  are  input  (c) . 

If  the  B-matrix  is  not  being  input  to 
the  CLASSIFY  processor  (i.e.,  B-matrix 
channels  are  not  involved) , the 
requested  channel  in  the  previous 
message  will  be  deleted  from  the  list 
of  channels  and  ignored  by  the  CLASSIFY 
processor.  In  this  case,  this  message 
is  added  to  the  previous  diagnostic 
message . 

If  the  B-matrix  is  input  to  the 
CLASSIFY  processor,  the  B-matrix 
channels  become  the  set  which  is  to 
be  used  in  classification;  and  if  one 
of  the  B-matrix  channels  is  not  avail- 
able in  the  training  subclass  statis- 
tics, the  processor  cannot  continue. 

In  this  case,  this  will  be  the 
remainder  of  error  message. 

Explanation 

A field  description  card  has  an 
incorrect  format.  All  vertices 
must  be  separated  by  coimnas  and 


HUN. 


Message 


Explanation 

enclosed  in  parentheses,  and  sample 
and  line  numbers  must  be  integers. 
The  card  which  caused  the  error  is 
printed  out  with  this  message. 


18.4  GRPSCN  ROUTINE 
Message 

/////  PROM  SUBR.  GRPSCN  

CLASS  XX  INCORRECT  — 

CLASS  XX  IGNORED. 

CARD  BEING  SCANNED  IS 
CCCC  •••  CCCC. 


18.5  CRDSTA  ROUTINE 
Message 

a.  EXCEEDED  CORE  LIMITS,  REDUCE  NO. 
OF  SUBCLASSES  OR  CHANNELS  EXIT- 
ING FROM  CRDSTA. 


b.  ERROR  IN  TRYING  TO  POSITION  STAT 
FILE  IN  CRDSTA. 


Explanation 

One  of  the  class  numbers  listed  on 
the  GROUP  control  card  has  one  or 
more  of  the  following  three  condi- 
tions: (1)  not  in  ascending  order, 

(2)  greater  than  the  largest  class 
number  allowable  (30) , or  (3)  has 
already  been  used  in  another  GROUP 
card.  The  erroneous  GROUP  control 
card  is  printed  as  part  of  the 
message.  The  processor  will  delete 
the  erroneous  class  number  from  the 
list  and  proceed  grouping  all  other 
listed  classes. 


Explanation 

The  combination  of  total  number  of 
channels,  subclasses,  and  training 
fields  must  be  reduced  to  fit  in  the 
internal  core  storage  available  to 
the  processor.  Total  storage  is 
10  600  locations. 

An  error  occurred  in  positioning  the 
SAVTAP  file,  and  no  statistics  were 
written.  Resubmit  the  run. 


18 . 6 REDSAV  ROUTINE 


Message 

a.  USER  HAS  REQUESTED  CHANNELS, 

SUBCLASSES , AND  CLASSES . 

THIS  COMBINATION  OF  STATS  WILL 
NOT  FIT  IN  CORE.  PLEASE  REDUCE 
REQUEST. 


b.  STAT  FILE  WAS  NOT  CREATED. 
EXITING  FROM  **REDSAV** 

C.  ERROR  IN  POSITIONING  STAT  FILE 
TO  FILE  . 

18.7  TAPERD  ROUTINE 
Message 

a.  A LINE  NO.  IS  LESS  THAN  OR  EQUAL 
ZERO. 

b.  DATA  ORDER  INDICATOR  = N.  DATA 
MUST  BE  ORDERED  BY  PIXEL. 

c.  ERROR  READING  FIRST  DATA 
RECORD  — ISTAT  = N. 


Explanation 

The  fixed  amount  of  internal  core 
storage  available  to  the  processor  for 
storing  class  descriptions,  number  of 
subclasses  in  each  class,  subclass 
descriptions,  field  information,  ver- 
tices, covariances,  means,  and  working 
area  has  been  exceeded.  The  total 
amount  of  storage  available  for  the 
above  information  is  10  600  locations.1 
Reduce  the  requested  combination. 

An  error  occurred  in  positioning  the 
SAVTAP  file,  and  no  statistics  were 
written . Resubmit  run , 

Same. 


Explanation 

The  first  line  number  on  the  data 
tape  is  less  than  or  equal  to  zero. 

Information  from  header  records  indi- 
cates data  are  not  ordered  samples  by 
channels. 

Error  occurred  while  trying  to  trans- 
mit first  record  from  MSS  DATAPE. 


The  equation  for  computing  the  required  storage  is: 

STORAGE  = 2 (number  of  classes)  + (number  of  subclasses)  + 4 (num- 
ber of  fields)  + 2 (total  number  of  vertices  for  all  the  fields) 

+ (number  of  subclasses  + 1) [ (number  of  channels) (number  of 
channels  + l)/2]  + (number  of  subclasses) (number  of  channels). 


Message 


d.  ERROR  WHILE  READING  DATA  RECORD. 

e.  FEATURE  NUMBERS  N AND  ABOVE  ARE 
NOT  ON  DATA  TAPE. 

f . FIELD  BOUNDARY  FOR  THIS  FIELD 
DEFINED  BEYOND  SCOPE  OF  DATA. 
THIS  FLIGHT  LINE  CONTAINS  N SCAN 
LINES . 

g.  FLDINT  MUST  BE  CALLED  TO  INI- 
TIALIZE PARAMETERS  FOR  A NEW 
FIELD. 

h.  LAST  SCAN  LINE  READ  N ISTAT=S. 


i.  NO.  OF  BITS/PIXEL  = N. 

ONLY  8 BITS  ACCEPTABLE  AT  THIS 
TIME. 


j.  NO.  OF  RECORDS  PER  DATA  SET  = N. 
MUST  BE  LESS  THAN  OR  EQUAL  TO  15. 


Explanation 

ISTAT  is  the  status  word  for  the  unit 
trying  to  read  the  tape  (see  mes- 
sage h) . 

Error  occurred  while  trying  to 
transmit  record  from  MSS  DATAPE. 

User  has  requested  a channel  not  on 
MSS  DATAPE. 

User  has  requested  scan  line  not  on 
MSS  DATAPE. 


For  every  field  input  there  must  be 
a call  to  FLDINT  to  reset  parameters 
for  positioning  the  MSS  DATAPE. 

Gives  the  number  of  last  scan  line 
read  before  problem  in  reading  data 
tape  occurred.  ISTAT  is  status  word 
for  the  unit  reading  the  MSS  DATAPE. 
For  example, 

ISTAT  * -1  (transmission  not  complete) 
ISTAT  = -2  (read  through  end  of  file) 
ISTAT  = -3  (device  error) 

ISTAT  = -4  (transmission  aborted) 

According  to  the  header  record,  the 
samples  on  the  MSS  DATAPE  do  not 
equal  eight  bits.  It  is  assumed  that 
the  header  record  is  in  error,  and 
execution  continues. 

One  data  set  cannot  contain  more  than 
15  records. 


rxje: 


1 


Message 

K.  ONLY  ONE  OR  LESS  RECORDS  PER 

CHANNEL  ACCEPTABLE  AT  THIS  TIME. 

£.  CHECK  THE  FOLLOWING  POSSIBLE 
ERRORS. 

1.  DATA  TAPE  IS  NOT  IN 
UNIVERSAL  OR  LARSYS  FORMAT. 

2.  IF  DATA  TAPE  IS  9-TRACK, 

THE  - ASG  - CARL  SHOULD  HAVE 
AN  - N - OPTION  AND  A MES- 
SAGE TO  OPERATOR  SHOULD  BE 
ON  588  FORM. 

3.  IF  DATA  TAPE  WAS  GENERATED 
ON  A MACHINE  OTHER  THAN 
THE  1108,  THE  - ASG  - CARD 
SHOULD  HAVE  AN  - A - OPTION. 

m.  UNRECOVERABLE  ERROR  READING 
HEADER  RECORD. 

n.  INTERNAL  DIMENSIONS  TOO  SMALL 

FOR  DATA.  NO.  OF  CHANNELS  ON 
DATA  TAPE  = XXXXXXX,  NO.  OF 
POINTS/CHANNEL  = . 


18.8  TAPWRT  ROUTINE 
Message 

a.  ABORTED  WHILE  IN  WRTHDR  PART  OF 
SUBROUTINE  TAPWRT  BECAUSE  OF  BAD 
TAPE. 

b.  DATA  SET  N WAS  BEING  WRITTEN 
WHEN  PROGRAM  ABORTED. 


Explanation 

All  of  the  samples  of  one  channel  must 
be  contained  within  one  record. 

The  MSS  DATAPE  must  be  in  LARSYS  II, 
LARSYS  III,  or  Universal  format. 
Additional  options  must  be  present 
on  the  ASG  card  if  tape  is  nine-track 
or  was  generated  on  a computer  other 
than  the  Univac  1108. 


Error  occurred  while  trying  to  read 
header  record. 

The  maximum  record  size  of  the  data 
record  exceeded  6800  words. 


Explanation 

Job  aborted  while  trying  to  write  the 
header  record. 

Job  aborted  while  trying  to  write 
data  set  N in  Universal  format. 


V -.  -v 

4 


***'  - 


Message 

c.  RECORD  N WAS  BEING  WRITTEN  WHEN 
PROGRAM  ABORTED. 

d.  USER  NSAMP  IS  TOO  LARGE,  THUS 
NSAMP  WAS  RESET  TO  2998. 


18.9  UNPAK1  ROUTINE 
Message 

*****  error  FROM  SUBR.  UNPAK1 

XX  BITS  EXCEEDS  3 2 -BIT 

WORD  LENGTH. 


Explanation 

Job  aborted  while  trying  to  write 
record  N in  LARSYS  II  format. 

User  requested  too  much  data  to  be 
packed  into  one  record.  The  maximum 
number  of  samples  per  record  for 
Universal  format  is  2998  samples. 


Explanation 

The  header-unpacking  routine 
UNPAK1  has  been  sent  (via  a 
calling  argument)  a byte  length 
exceeding  32  bits.  The  routine 
rejects  any  byte  length  greater 
than  the  3 2 -bit  size.  The 
header  record  of  the  MSS  DATAPE 
will  not  unpack  when  this  con- 
dition is  detected. 
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APPENDIX  A 


LARSYS  II  (OR  III)  FORMAT  FOR  AN  MSS 
DATA  STORAGE  TAPE 

This  is  the  second  and  third  versions  of  the  format  used  in 
Purdue's  LARSYS.  The  only  difference  in  the  second  and 
third  version  of  the  format  is  one  word  in  the  header  record. 
That  difference  is  transparent  to  this  system. 

There  are  four  types  of  (physical)  records  on  the  Multi- 
spectral  Scanner  Data  Storage  Tapes.  They  are: 


1.  ID  record  - 200  full  words  fixed  length 

2.  Data  record  - variable  length 

3.  End-of-Tape  records  - 200  full  words  fixed  length 

4.  End-of-File  records  - IBM  Standard 


A Multispectral  Scanner  Data  Storage  Tape  contains  one  or 
more  data  runs  consisting  of  an  ID  record,  several  data 
records  and  an  End-of-File  record.  After  the  last  data 
record  on  the  tape,  an  End-of-Tape  record  and  two  End-of-File 
records  are  written  on  the  tape. 

For  the  purposes  of  this  presentation,  a 'word'  is  defined 
to  be  32  bits  and  a byte  to  be  8 bits.  Further  details 
regarding  the  physical  records  follows: 


1.  ID  record 

ID  Cl) 

ID  (2) 

ID  (3) 


ID  (4) 


ID  (5) 

ID  (6) 

ID (7-10) 


(200  full  words  fixed  length) 


FORMAT  DESCRIPTION 


Integer 

Integer 

Integer 


Integer 


Integer 

Integer 

Alpha- 

numeric 

(4A4) 


LARS  Tape  Number  (e.g.,  1,  17, 

102,  etc.) 

File  number  on  this  tape 
Run  number  (8  digits  aabbbbcc) 
aa  - last  2 digits  of  the  Year 
data  was  acquired 
bbbb  - running  serial  number  for 
the  year  data  was  taken 
cc  - uniqueness  digits  for  runs 
which  would  otherwise  have 
the  same  run  number 
Continuation  Code 
ID (4)  - 0 means  the  first  line  of 
data  follows  this  ID 
record 

ID (4)  = X means  that  the  data 
following  this  ID 
record  is  a continuation 
of  a flight  line  started 
tape  X 

Number  of  Data  Channels  (Spectral 
bands)  on  tape  (30  maximum) 

Number  of  Data  Samples  per  channel 
per  scan  line 

Flightline  Identification  (16 
characters) 


FORMAT 

DESCRIPTION 

ID  (11) 

Integer 

Month  data  was  taken 

ID (12) 

Integer 

Day  data  was  taken 

ID  (13) 

Integer 

Year  data  was  taken 

ID (14 ) 

Alpha- 

numeric 

(1A4) 

Time  data  was  taken 

ID ( 15 ) 

Integer 

Altitude  of  aircraft 

ID  (16) 

Integer 

Ground  heading  of  aircraft 

ID  (17- 

19) 

Alpha- 

numeric 

(3A4) 

Date  data  run  was  generated  on 
this  tape  (12  characters) 

ID  (20- 

50) 

Integer 

All  zero  (to  be  defined  later) 

ID  (51) 

Real 

Lower  limits  in  micrometers  of 
first  spectral  band  on  tape 

ID (52) 

Real 

Upper  limits  in  micrometers  of 
first  spectral  band  on  tape 

ID  ( 53) 

Real 

The  suggested  value  of  "CO" 
calibration  pulse 

ID  (54 ) 

Real 

The  suggested  value  of  "Cl" 
calibration  pulse. 

ID ( 55) 

Real 

The  suggested  value  of  "C2" 
calibration  pulse. 

ID  (56- 

200) 

Real 

Repeat  of  ID (51-55)  for  ID (5) 
channels  in  order  of  appearance 
in  Data  Records. 

ID  (51- 

200) 

Real 

= 0.0  if  Data  Channels  do  not 
exist. 

t 


Data  Record: 

Each  data  record  will  contain  one  scan  line  of  data  from 
ID (5)  (see  ID  Record)  channels.  The  first  half  word 
(2  bytes)  of  the  record  will  be  the  record  number*  The 
second  half  word  (2  bytes)  will  be  the  roll  paramater 
which  is  a number  indicating  relative  roll  of  the 
aircraft  for  this  scan  line  of  data.  If  the  roll  para- 
meter is  -32.767,  the  data  for  the  given  line  does  not  exist. 
If  the  roll  parameter  has  not  been  calculated,  it  will  be 
set  to  32,767.  The  fifth  byte  will  be  the  first 
data  sample  from  the  first  channel.  The  data  samples  are 
ordered:  Channel^,  Sample^  - Sample^;  Channel^,  Sample^- 

SampleN;  and  so  on  through  ID  (5) , Channels  and  ID  (6) 
data  samples  per  channel.  A data  record  (scanline)  will  be 
ID (5)*  ID (6) +4  bytes  long. 

The  data  from  each  channel  will  be  from  the  field  of  view 
of  the  scanner  except  the  last  six  bytes.  The  last  six 
are  calibration  data  in  the  order  of  appearance: 


1. 

C0 

"0"  or  dark 

level 

2. 

vc0 

Variance  of 

C0 

3. 

C1 

Calibration 

source 

C 

4. 

VC1 

Variance  of 

C1 

5. 

C2 

Calibration 

source 

C 

6. 

VC2 

Variance  of 

C2 

where  = Calibration  value  i and  VC^  = calculated  variance 
of  calibration  value  i. 


"r 


On  good  data  records  all  8-bit  data  and  calibration  values  will 
be  integers  in  the  range  of  0 to  255  with  no  sign  included 
in  the  eight  bits.  A sample  data  value  of  0 to  255  is  the 
result  of  the  8-bit  analog-to-digital  conversion  which  pro- 
duces the  multispectral  scanner  data  tape.  With  8-bit  A/D 
conversion,  data  values  range  between  0 to  255  with  0 
usually  indicating  low  relative  irradiance  and  255  usually 
indicating  high  relative  irradiance. 

End-of-Tape  Record: 

The  End-of-Tape  Record  is  very  similar  to  the  ID  Record  with 
200  full  words  in  the  following  format. 


WORD 

FORMAT 

DESCRIPTION 

ID  (1) 

Integer 

LARS  tape  number 

ID  (2) 

Integer 

File  number  on  this  tape 

ID  (3) 

Integer 

Set  equal  to  zero 

ID  (4 ) 

Integer 

Continuation  Code 
ID (4)  = 0 means  end  of  data 
ID (4)  = X means  data  in  previous 
file  is  continued  on 
tape  X 

ID  (5-50) 

Integer 

All  zero  (may  be  defined  later) 

ID ( 51-200) 

Real 

0.0  (may  be  defined  later) 

•*  v 
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APPENDIX  B 


UNIVERSAL  FORMAT  FOR  AN  MSS  DATA  STORAGE  TAPE 

This  is  an  adaptation  of  the  Universal  Data  Tape  Format  as 
defined  in  the  Earth  Resources  Data  Format  Control  Book,  Vol.  1 
(ref.  12,  section  7). 


Ground  Rules 


The  ground  rules  for  the  UNIVERSAL  format  as  accepted  by 

all  the  processors  within  this  system  are  as  follows: 

• 8 bits  = 1 byte 

• The  header  record  is  the  first  record  on  a tape. 

• The  header  record  is  fixed  length  equal  to  3060  bytes. 

• Data  following  the  header  will  be  arranged  by  data  sets, 
where  a data  set  is  defined  as  the  ancillary  data  and 
all  of  the  video  data  for  one  scan  line  for  all  active 
channels. 

• Data  sets  will  be  recorded  in  variable  length  physical 
records,  not  to  exceed  3000  bytes  of  information  per 
record.  Note,  since  3000  bytes  is  not  compatible  with  the 
word  lengths  of  all  computers,  the  computer  generating 
the  tape  will  add  a sufficient  number  of  fill  zeros  to 
the  end  of  the  data  to  make  the  record  length  divisible 

by  32,  36,  48,  and  60  bits  (180  bytes).  Therefore,  it 
is  possible  to  have  a max  physical  record  length  of  3060 
bytes,  but  under  no  condition  will  data  exceed  3000  bytes. 


— 


• Data  sets  will  be  packed  into  consecutive  physical  records 
of  equal  length.  Under  no  condition  will  a data  set  begin 
in  the  middle  of  a physical  record  unless  the  data  set 
can  be  completed  in  that  record.  If  two  or  more  records 
are  needed  for  the  data  set,  the  data  set  will  be  divided 
but  under  no  condition  will  the  data  for  a video  channel 
begin  in  the  middle  of  a physical  record  unless  the  data 
for  that  video  channel  can  be  completed  in  that  record. 
Consequently,  data  sets  which  are  lengthy  will  be  divided 
so  that  the  ancillary  block  and  video  data  from  an  inte- 
gral number  of  channels  will  be  in  one  record  and  remain- 
ing video  data  will  follow  in  succeeding  records  with  an 
integral  number  of  channels  per  record.  Fill  zeros  will 
be  supplied  at  the  ends  of  the  records  as  required  to 
satisfy  the  equal  length  contraint  noted  above. 

• All  data  in  the  header  record  and  ancillary  blocks  will 
be  in  binary. 

• The  tape  format  will  be  as  follows: 

Header  Record 
IRG* 


Ancillary  Block 
Video  Block 


Data  Set 


IRG** 


Ancillary  Block 
Video  Block 


Data  Set 


IRG 


EOF 


* IRG  = Inter  Record  Gap  - always  follows  the  header  record. 
**  An  IRG  may  appear  between  the  ancillary  block  and  the  video 
block  so  that  the  recording  of  a data  set  requires  more  than 
one  physical  record;  or  a physical  record  may  contain  two  or 
more  data  sets,  not  separated  by  any  IRG.  See  ground  rules 

above  and  data  set  description  following  for  criteria  deter- 
mining the  placement  of  IRG's. 


Header  Record 


Although  the  header  record  is  3060  bytes  in  length,  only  a 
portion  of  the  information  is  pertinent  to  the  system  at  this 
time.  A general  description  of  the  data  that  is  unpacked 
by  the  TAPERD  routines  is  as  follows: 


NO.  OF 

BYTE  NO.  DESCRIPTION  BYTES 

89  Processing  flag  1 

0 = Raw  Data 

1 = Processed  data  from  computing  system 

90  No.  of  channels  in  this  job  1 

91  No.  of  bits  in  a picture  element  1 

(Must  be  8 at  this  time) 

92-93  Address  of  start  of  video  data  2 


gives  location  of  start  of  video  within 
scan. 


96-97 

100-101 

102 


No.  of  video  elements  per  scan  within  a 2 

single  channel. 

Physical  record  size  in  bytes  2 

This  number  must  be  a multiple  of  180  bytes. 

No.  of  channels  per  physical  record  1 

This  field  refers  to  the  second  and  sub- 
sequent records  within  the  recording  of 
a data  set.  Bytes  1785-1786  give  the 
number  of  channels  of  data  in  the  first 
record  of  a data  set. 

If  no.  elements  per  channel  greater  than 
3Kj  this  field  will  equal  0. 


BYTE  NO. 
103 


104 

105-106 

107 


108-109 


NO.  OF 

DESCRIPTION  BYTES 

No.  of  physical  records  per  scan  1 

per  channel.  This  field  is  used  only 
when  the  no.  of  elements  per  channel 
is  greater  than  3K.  Otherwise  it  is 
equal  to  0 . 


No.  of  records  to  make  a complete  data  set  1 

Length  of  ancillary  block  in  bytes  2 

Data  Order  Indicator  1 

0 = Video  ordered  by  channel. 

1 = Video  ordered  by  pixel 

Start  Pixel  No.  2 


Number  of  the  first  pixel  per  scan  on 
this  tape  referenced  to  original  image. 
The  first  pixel  in  the  original  image 
is  pixel  number  one. 


110-111  Stop  Pixel  No. 

Number  of  the  last  pixel  per  scan  on 
this  tape  referenced  to  original  image. 

1778  Number  of  Data  Sets  per  Physical  Record 

1785-1786  Number  of  channels  in  the  first  physical 

record  of  the  data  set 


2 


1 

2 


1787-1788  Total  number  of  bytes  per  scan  per  channel  2 


Data  Sets 


Ancillary  Block 

*»  The  first  block  of  a data  set  is  the  ancillary  block. 

• The  length  of  the  ancillary  block  is  variable,  with  the 
number  of  bytes  given  in  the  header  record. 


• The  first  word  (2  bytes)  of  every  record  is  a counter 
giving  the  number  of  the  physical  record  within  the  video 
data  set.  This  is  primarily  intended  for  use  in  data  sets 
that  are  greater  than  3000  bytes  long  and  therefore  require 
more  than  one  physical  record  for  recording.  This  word 
will  always  equal  Ml"  for  the  first  record  of  a data  set. 

• Bytes  3 through  6 will  contain  the  current  GMT  at  the  start 
of  this  data  set  recorded  in  tenths  of  milliseconds. 

• Bytes  7 through  70  will  indicate  channel  status  for  this 
scan,  one  byte  per  channel,  where  the  LSB  = 0 indicates  the 
channel  is  sync,  and  the  LSB  = 1 indicates  the  channel  not 
in  sync, 

• 71-72  contain  the  scan  line  number.  This  will  be  an  arbi- 
trary but  sequential  count  for  each  scan  line  that  appears 
in  the  data  run. 

• Bytes  73  through  N will  be  dependent  on  whether  this  job 
contains  raw  or  processed  data.  (See  byte  89  in  the  header 
record.)  The  value  of  N will  be  given  in  bytes  105  and 
106  in  the  header  record  and  will  always  be  equal  to  or 
greater  than  70. 

• If  this  job  contains  raw  data,  bytes  73  through  N will 
contain  the  housekeeping  data  channel  from  the  sensor, 
if  one  is  available. 

• A job  containing  processed  data  will,  in  addition  to  the 
70  bytes  of  ancillary  data  already  described,  contain, 
at  a minimum,  the  following  pieces  of  information: 

• Latitude  of  the  aircraft  or  of  the  center  of  the  image 
from  EREP  or  satellite  in  binary. 

• Longitude  of  the  aircraft  or  of  the  center  of  the  image 
from  EREP  or  satellite  in  binary. 


• Altitude  in  meters  recorded  in  binary. 

• Heading  in  tenths  of  a degree. 

• Ground  speed  in  meters  per  second. 

• Roll  - Defined  in  specific  formats,  following. 

• Pitch  - Defined  in  specific  formats,  following. 

• Yaw  - Defined  in  specific  formats,  following. 

• Sun  angle. 

The  specific  formats  for  each  sensor  (following  in  this 
section)  shall  provide  where  this  data  will  appear  in 
the  format. 

Other  parameters  may  be  added,  if  required,  with  the  length 
of  the  ancillary  block  given  in  the  header. 

Video  Data 

• Following  the  ancillary  block  in  each  data  set  will  be  the 
video  data  from  all  of  the  active  channels  for  one  scan. 

The  video  data  from  all  of  the  active  channels  for  one 
scan  will  comprise  a video  block. 

• Video  blocks  within  a data  run  will  always  contain  the 
same  number  of  video  channels. 

• Each  video  block  will  be  the  same  number  of  bytes  in  length. 
If  video  data  is  not  available  to  fill  a block,  fill  zeros 
will  be  added  to  make  it  the  same  length  as  preceding  video 
blocks. 

• Video  data  less  than  8 bits  per  pixel  will  be  packed,  right 
justified,  in  an  8-bit  byte  with  zeros  added  to  the  left. 


• Video  data  greater  than  8 bits  per  pixel  will  be  packed , 
right  justified,  in  as  many  8-bit  bytes  as  necessary  to 
hold  the  pixel,  with  zeros  added  to  the  left. 

• If  this  tape  contains  raw  data,  the  PCM  sync  words  asso- 
ciated with  the  video  data,  if  any,  will  be  included  with 
the  video  data  on  this  tape.  If  this  tape  contains  pro- 
cessed data,  no  sync  words  will  be  present. 

• Calibration  data  that  is  associated  with  each  scan  within 
each  channel  will  be  included,  if  this  tape  contains  raw 
imagery  data,  in  the  same  sequence  as  it  appears  in  the 
data  stream  on  the  flight  tape.  If  this  tape  contains 
processed  imagery  data,  the  appearance  of  the  calibration 
data  will  depend  on  the  specific  sensor  requirements  and 
will  be  specified  in  the  respective  format  following  in 
this  document. 

• The  combined  length  of  the  ancillary  block  and  the  video 
block  will  determine  the  relationship  between  data  sets 
and  physical  records.  Some  data  runs  may  contain  data  sets 
which  are  so  small  more  than  one  can  be  packed  into  one 
physical  record.  Others  may  contain  data  sets  which  will 
require  a whole  physical  record  for  each.  Still  others 
may  contain  data  sets  which  are  so  long  that  each  data 

set  will  require  two  or  more  physical  records. 

• Data  sets  will  be  packed  in  physical  records  depending  on 
the  length  cf  the  data  set.  The  ancillary  block  will  always 
appear  in  the  first  physical  record  per  data  set.  Follow- 
ing the  ancillary  block,  as  many  complete  channels  in  this 
data  set  will  be  recorded  as  will  fit  in  up  to  3000  bytes. 

If  the  data  set  is  too  long  to  be  recorded  in  one  physical 
record,  the  second  and  subsequent  records  will  begin  with 
the  next  active  channel  in  the  data  set. 


• If  a video  block  is  divided  between  more  than  one  record 
for  recording,  the  number  of  data  channels  in  the  first 
record  may  vary  from  the  number  of  channels  in  the  second 
and  successive  records;  however,  the  number  of  channels  in 
all  records  following  the  first  per  data  set  will  always 
be  the  same.  The  number  of  channels  in  the  first  record  and 
the  number  in  successive  records  will  be  given  in  the  header 
record.  In  records  following  the  first,  if  video  data  is 
not  available  so  as  to  allow  all  records  to  contain  the 
same  number  of  channels,  fill  zeros  will  be  added  in  lieu 
of  the  video  data  in  order  to  make  all  records  the  same 
length.  In  addition,  fill  zeros  will  be  added  to  either 
the  first  record  or  all  of  the  successive  records,  depend- 
ing on  which  is  shorter,  so  as  to  make  all  of  the  records 
the  same  length. 


• The  only  arrangement  of  pixels  within  a scan  of  data  will 
be  by  channel . The  Universal  format*  will  be  as  follows: 


LINE  1 


CHANNEL  1 


CHANNEL  2 


CHANNEL  M 


PIXEL  1 
PIXEL  2 


PIXEL  N 
PIXEL  1 
PIXEL  2 


PIXEL  N 


PIXEL  1 


PIXEL  N 

* If  this  taoe  contains  raw  imagery  data,  the  PCM  sync  words,  if 
any,  that  are  associated  with  the  data  on  the  flight  tape  will  be 
included  with  the  data. 


PIXEL  1 


LINE  2 


CHANNEL  1 


CHANNEL  M 


PIXEL  N 


PIXEL  1 


PIXEL  N 


vis**®*;** 


■•v 
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APPENDIX  C 


MAPTAP  FORMAT 

The  tape,  MAPTAP,  is  output  by  the  processor  CLASSIFY.  It 
contains  the  statistics  actually  used  in  classification,  the 
training  field,  category,  class,  and  subclass  information, 
and  the  classified  data. 


Each  file  consisting  of  the  following  types  of  records: 


Repeated  for  each 
classified  field 


4 - run  header  records 

{1  - field  header  record 

N - data  records 
1 - end-of-field  record 

1 - end-of-run  record 

end-of-file  mark 


All  records  are  written  with  a nonformatted  FORTRAN  write 
statement , 


RUN  Header  Record  1 

WRITE (MAPTAP) (DATE (I) ,1=1,2) , BMFLG , BMCOMB , BMFEAT , NOCLS 2 , 
N0FLD2 , NOSUB 2 ,N0FET2 ,T0TVT2 , NOCAT, VARS Z 2 , 
(FETVC2 (I) , 1=1 ,N0FET2) 

FORTRAN  NAME 

AND  DIMENSION  DESCRIPTION 


DATE ( 2 ) 
BMFLG 

BMCOMB 


Date  classification  was  performed 

Flag  indicating  B-MATRIX  was  used 
in  classification 

No.  of  linear  combinations  in 
B-MATRIX 


FORTRAN  NAME 
AND  DIMENSION 


DESCRIPTION 


BMFEAT 

NO. 

the 

NOCLS2 

No. 

NOFLD2 

No. 

N0SUB2 

No. 

NOFET2 

No. 

TOTVT2 

No. 

NOCAT 

No. 

VARS  25  2 

Siz 

FETVC2 (NOFET2) 

Act 

of  channels  used  in  computing 
B-MATRIX 

of  classes 

of  training  fields 

of  subclasses 

of  channels  used  in  classification 
of  vertices  in  training  fields 
of  categories 

■ of  covariance  for  each  subclass 
tal  channels  used  in  classification 


RUN  Header  Record  2 


WRITE (MAPTAP) (CATNAM (I) ,I=1,N0CAT1) , (CLSMTX(I) ,I=1,N0CLS2) , 
(SUBNO(I) , 1=1 ,N0CLS2) , (SUBDES(I) ,I=1,N0SUB2) , 

( (FLDMTX (I , J) ,1=1,4) ,J=1,N0FLD2) , ( (VERTEX (I, J) , 
1=1,2) , J=1 ,T0TVT2) , (SUBCAT (I) ,1=1, N0SUB2) , 
(CLSVC2 (I)  ,I=1,N0SUB2)  , (KATNO  (I) ,I=1,N0CLS2)  , 
(KEPPTS (I) , 1=1 ,N0SUB2) 


FORTRAN  NAME 

AND  DIMENSION  DESCRIPTION 


CAT NAM  (NOCAT 1)  Category  names  (if  available) 

N0CAT1  = no.  of  categories  if 
CATEGORY  classifier  was  applied 
N0CAT1  = no.  of  classes  if  STANDARD 
classifier  was  applied. 

Class  names 


4ST, 


CLSMTX  (N0CI.S2) 


1 


FORTRAN  NAME 
AND  DIMENSION 


SUBNO (NOCLS2) 
SUBDES (NOSUB2) 
FLDMTX ( 4 , NOFLD2 ) 


VERTEX ( 2, TOTVT2) 


SUBCAT (NOSUB 2) 


CLSVC2  (NOSUB2) 
KATNO (NOCLS2) 
KEPPTS (NOSUB2 ) 


DESCRIPTION 


No.  of  subclasses  in  each  class 


Subclass  names 


Training  field  information 

1 - field  name 

2 - Class  number  field  belongs  to 

3 - Subclass  number  field  belongs  to 

4 - No.  of  vertices  in  this  field 


Vertices  for  all  the  fields;  ordered 
(sample , line) ^ , (sample , line) ^ • • • • 
(sample , line) T0TVT2 

Contains  the  category  number  to 
which  each  corresponding  subclass 
belongs 

Contains  the  class  number  to  which 
each  corresponding  subclass  belongs 


Contains  the  category  number  to 
which  each  class  belongs 


Contains  the  total  number  of  train- 
ing field  pixels  in  each  subclass 


RUN  Header  Record  3 


WRITE (MAPTAP) ( (COVMTX(I,J) ,I=1,VARSZ2) ,J=1,N0SUB2) 
( (AVEMTX(I , J) , 1=1 ,N0FET2) , J=1,N0SUB2) 


FORTRAN  NAME 
AND  DIMENSION 


DESCRIPTION 


COVMTX ( VARS  Z 2 , NOSUB  2 ) 


Original  or  B-transformed  covariance 
matrix  for  each  subclass 


AVEMTX (N0FET2 ,N0SUB2) 


Mean  vector  for  each  subclass 


f<s/ 


i ft-**- 


WRITE (MAPTAP) ( (COVMTX (I , J) ,I=1,VARSZ2) , J=1 ,NOSUB2) , 
( CON ( I ) , 1=1 ,NOSUB2) , (DET(I) ,I=l,NOSUB2) 


FORTRAN  NAME 

AND  DIMENSION  DESCRIPTION 

COVMTX ( VARS Z 2, NOSUB2)  'Modified'  Cholesky  Factorization 

of  the  covariance  matrix  for  each 
subclass 

Subclass  constants 

Determinant  of  covariance  matrix 
for  each  subclass 

Field  Header  Record 

WRITE (MAPTAP) (FLDINF(I) ,1=1,6) , PTS , LINES , FLDESC , NC , 

(VERTCS (I) , 1=1 ,NC) , (VERTCS(I+NC) ,NC=1,NC) 

FORTRAN  NAME 

AND  DIMENSION  DESCRIPTION 

FLDINF (6)  Rectangular  coordinates  surrounding 

the  field  classified, 

1 - line  start 

2 - line  stop 

3 - line  increment 

4 - sample  start 

5 - sample  stop 

6 - sample  increment 

No,  of  points  in  the  rectangular 
field  defined  in  FLDINF 


CON  (N0SUB2) 
DET (N0SUB2) 


PTS 


FORTRAN  NAMES 
AND  DIMENSION 

LINES 

FLDESC 

NC 

VERTCS (2 ,NC) 


DESCRIPTION 


No.  of  lines  in  the  rectangular 
field  defined  in  FLDinf 

Name  of  the  classified  field 

No.  of  vertices  in  the  classified 
field 

Vertices  for  the  classified  field. 
Vertices  are  ordered  (sample,  line)^ 
(sample,  line) 2 , •••  (sample,  line)NC 

Data  Record 


WRITE (MAPTAP) ILINE, (IR(I) ,I=1,PTS) , (VR(I) ,I=1,PTS) 

FORTRAN  NAME 

AND  DIMENSION  DESCRIPTION 

Line  number  in  reference  to  the 
multispectral  scanner  data  tape 

Subclass  number  to  which  each 
classified  data  point  belongs 

Liklihood  that  the  point  belongs 
to  that  subclass 

End-of-Field  Record 

An  end-of-field  record  has  the  same  format  as  a data  record 
with  ILINE=0. 

End-of-Run  Record 


ILINE 

IR(PTS) 

VR(PTS) 


An  end-of-run  record  has  the  same  format  as  field  header 
record  with  PTS=0. 


APPENDIX  D 


NDIM  FORMAT 
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APPENDIX  D 


NDIM  FORMAT 


The  interface  tape  NDIM  is  output  by  the  NDHIST  processor  and 
read  by  the  SCTRPL  processor. 


All  records  are  written  with  a nonformatted  Fortran  WRITE  state- 
ment. The  header  record  is  always  the  first  file  on  the  tape. 


The  format  of  the  tape  is  as  follows: 

HEADER  RECORD 


File  1 


Data  file  1 < 


END  OF  FILE  (EOF) 

RECORD  1 
RECORD  2 

RECORD  3 (optional) 
RECORD  4 
RECORD  5 
RECORD  6 

RECORD  7 (optional) 
EOF 


Data  file  N EOF 


The  contents  of  each  record  are  as  follows: 

Header  Record 


TOTMNS 

SIZE 

NOFET2 

(FETVC2 (I) , 1=1 ,N0FET2) 
NCLRCH 

( CLRVEC ( I ) , 1= 1 , NCLRCH ) 


— Total  number  of  means  computed 

— N0FET2/4 

— Number  of  channels  to  histogram 

— Actual  channels  to  histogram 

— Number  of  color  code  channels 

— Actual  color  code  channels 


Record  1 


N0FLD2 

N0SUB2 

T0TVT2 

NOVEC 


— Number  of  fields  histogrammed 

— Number  of  subclasses  histogrammed 

— Number  of  vertices 

— Number  of  unique  vectors 
histogrammed 

Record  2 


CLSVC2  — Class  name 

(SUBVC2 (I) ,I=1,N0SUB2)  -Subclass  names 

( (FIELDS (I, J) ,1=1,4) ,J=1,N0FLD2)  -Field  information 
( (VERTEX(I,J) ,1=1 ,2) ,J=1,T0TVT2)  -Field  vertices 


Record  3 (optional) 

(MEANS (I) , 1=1 ,TOTMNS)  —Mean  stats  for  input  fields 

Record  4 

( (PLOT(I,J) ,1=1, SIZE) ,J=1, NOVEC)  - Data  vectors 


Record  5 


(ID (I ) ,1=1, NOVEC)  - Class/subclass/field  the  data 

vectors  belong  to 

Record  6 

(COUNTR (I) , 1=1 , NOVEC)  — Number  of  occurrences  of  the 

data  vectors 

Record  7 (optional) 

(COLOR (I) , 1=1 , NOVEC)  — Color  codes  extracted  from  MSS 

data  tape 
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DESCRIPTION  OF  CLUSTER  IMAGE  DISPLAY 
WITH  COLOR  KEYS 


The  cluster  image  data  tape  output  by  the  ISOCLS  processor  con- 
tains the  mean  vector  to  which  each  corresponding  pixel  was 
assigned  during  clustering  and  a color  key.  The  color  key  is 
an  n number  of  square  images  of  10  samples  by  10  lines  in  dimen- 
sions. A color  code  square  is  composed  of  the  mean  vector  for  a 
given  cluster.  The  color  codes  are  ordered  according  to  cluster 
number  or  greenness.  The  greenness  ordering  (G)  is  a function 
of  the  four  Landsat  channels: 


Gi  f N -°'29lJl,N  “ 0,56u2,N  + °*60lJ3,N  + °*49p4,N 
M 


(E-l) 


GN  ” 2 Gi,N 


i=l 


where 

M = number  of  passes  for  multiregistered  Landsat  data 
= first  channel  of  pass  i 
V 2 = secone*  channel  of  pass  i 
= third  channel  of  pass  i 
4 = fourth  channel  of  pass  i 
i = number  of  pass 
N = cluster  number 


The  number  of  color  codes  per  scan  line  is  computed  by 


number  of  samples  per  scan  line 
11 


K 


(E-2) 


-"V » 


The  number  of  lines  required  to  display  the  color  codes  is 
computed  by 

L = [(number  of  g1.usters  - 11  + l]  x 11  (E-3 

The  clustered  field  and  color  key  are  separated  by  a scan  line  of 
zeros.  Each  color  code  square  is  separated  by  a l-by-10  vertical 
I*  line  of  zeros. 


The  data  are  output  in  LARSYS  II  or  Universal  format  (see 
( appendixes  A and  B,  respectively) . 

f 

i The  structure  of  the  file  is  as  follows. 


k 

N records 


HEADER  RECORD 
IRG1 

- Mean  vector  for  each  corresponding  pixel 


IRG 


(N  + 1)  records  — Scan  line  of  zeros 

IRG 

0 
0 

0 color 

• code  K 

• 

0 


IRG 

• 

1 (N  + L)  records  — color  code  (last  cluster) 


(N  + 2)  records  - 10  lines i 


0 0 

0 0 

color  0 color  0 

code  1 j code  2 j 

• • 

0 0 

i \ ‘ 


10  samples 


E-O-F2 


^Inter- record  gap. 
2End  of  file. 
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PLOTAP  FORMAT 

The  scatter  plot  image  tape,  PLOTAP,  contains  two-axis  color- 
coded  spectral  plot(s)  and  is  output  in.  the  Universal  format 
(appendix  B)  by  the  SCTRPL  processor.  Each  file  of  the  multi- 
file tape  contains  (1)  a single  scatter  plot  image,  of  which 
N - 1 channels  are  color  assignments  and  the  Nifc  channel  is  the 
frequency  channel,  and  (2)  a color  key,  unless  the  color  assign- 
ment is  the  radiance  values  of  the  output  pixel. 


The  color  key  is  an  N number  of  square  images  dimensioned 
10  samples  by  10  lines.  A color  code  square  is  composed  of  the 
colors  assigned  to  a given  pixel.  Each  color  code  is  ordered 
according  to  its  cluster  association;  i.e.,  the  color  code 
associated  with  cluster  1 is  output  first,  followed  by  the  color 
code  associated  with  cluster  2,  etc.  The  number  of  color  codes 
per  scan  line  is  computed  by 


o,  _ number  of  samples  per  scan  line 

11 


(F-l) 


The  number  of  lines  required  to  display  the  color  codes  is 
computed  by 


(number  of  clusters  - 
K 


+ 1 


X 11 


(F-2) 


The  scatter  plot  image  and  color  key  are  separated  by  a scan 
line  of  zeros.  Each  color  code  square  is  separated  by  a l-by-10 
vertical  line  of  zeros. 


c-  U 


The  dimensions  of  the  output  tape  are  user  controlled  via  the 
control  cards  SIZE  and  CHANNELS  (see  section  16,  table  16-1,  and 
section  15,  table  15-1,  respectively),  where 

(1)  number  of  samples  per  scan  lines  = XSIZ 

(2)  number  of  channels  = dimensions  of  color  pixel  plus  the 

frequency  channel 

(3)  number  of  scan  lines  = YSIZ  + L 


The  structure  of  the  file  is  as  follows: 


HEADER  RECORD 
IRG1 

YSIZ  records  — scatter  plot  image 


IRG 


or 


EOF  (if  color  key  is  omitted) 
(YSIZ  + 1)  records  — scan  line  of  zeros 

IRG 


( YXIZ  + 2)  records  — 10  lines  J 


color 
code  1 


0 

0 

0 color 
; code  2 


0 

0 

0 


0 

0 

0 color 
• code  K 


l 0 0 

^ t j >•  j 

10  samples 


0 

v 


IRG 

(YXIZ  + L)  records  — color  code  (last  cluster) 

E-Q-F2 


^Inter-record  gap 
2End  of  file. 
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Items  3 and  4 are  the  major  contributors  to  execution  time 


SAMPLE  EXECUTION  TIMES  FOR  GRAYMAP 
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Item  3 is  the  major  contributor  to  execution  time 
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SAMPLE  EXECUTION  TIMES  FOR  SELECT 


SAMPLE  EXECUTION  TIMES  FOR  SELECT 
(continued) 


SAMPLE  EXECUTION  TIMES  FOR  CLASSIFY 
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The  major  contributors  to  execution  time  are  items  1,  2,  3,  and  5.  Runs  1 and  2 show  the 
time  differences  in  the  category  classifier  and  the  standard  classifier. 
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The  major  contributors  to  execution  time  in  DISPLAY  are  items  4,  5,  and  8,  The  empirical 
thresholding  option  takes  about  twice  as  long  as  the  chi  square. 
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Items  3 and  4 are  the  major  contributors  to  execution  time 


SAMPLE  COMPUTER  RUNS  FOR  EXECUTING 
PROCESSORS  BACK  TO  BACK 
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SAMPLE  COMPUTER  RUNS  FOR  EXECUTING 
PROCESSORS  BACK  TO  BACK 


1.  Sample  run  1 executes  CLASSIFY  using  the  category 
classifier  then  executes  CLASSIFY  using  the  standard 
classifier.  Then  DISPLAY  is  executed  for  each  of 
the  classifications. 

2.  Sample  run  2 executes  ISOCLS,  CLASSIFY,  and  DISPLAY 
back  to  back. 

3.  Sample  run  3 executes  STAT,  SELECT,  and  DATA-TR 
back  to  back. 

4.  Sample  run  4 executes  CLASSIFY  and  DISPLAY 
back  to  back. 

5.  Sample  run  5 executes  STAT,  SELECT,  CLASSIFY,  and 
six  executions  of  DISPLAY  and  demonstrates  different 
ways  of  thresholding  and  input  of  test  fields  in 
DISPLAY . 

6.  Sample  run  6 demonstrates  the  use  of  the  DATAFILE 
control  card  in  accessing  several  files  of  a data 
tape  in  back-to-back  executions  of  HIST. 
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SAMPLE  RUN  1 

Run  1 executes  CLASSIFY  twice  and  DISPLAY  twice.  The  first 
execution  of  CLASSIFY  defines  two  categories  for  eight  classes 
and  nine  subclasses  of  line  C-l  statistics.  The  statistics  are 
input  by  card  deck  and  will  be  written  on  unit  A.  The  second 
execution  of  CLASSIFY  uses  the  same  statistics,  and  a standard 
classification  on  the  nine  subclasses  is  performed.  Both  exe- 
cutions of  CLASSIFY  go  on  the  tape  assigned  to  unit  B,  files  1 
and  2.  The  two  executions  of  DISPLAY  print  the  map  for  both 
files. 
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SAMPLE  RON  4 


The  CLASSIFY  and  DISPLAY  processors  are  being  executed  back  to 
back,  thereby  allowing  the  user  to  classify  the  data  and  apply 
thresholding  to  the  classification  results  in  one  execution. 

The  B-Matrix  deck  being  input  to  CLASSIFY  was  punched  by  a pre- 
vious run  of  SELECT;  the  threshold  values  being  input  to  DISPLAY 
will  be  the  actual  threshold  values  used  in  thresholding  the 
classified  data. 
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SAMPLE  RUN  5 


/ 

4 / , 

The  STAT,  SELECT,  CLASSIFY,  and  DISPLAY  processors  are  being 

executed  back  to  back,  thereby  allowing  the  user  to  do  the 
following : 

1.  Obtain  statistics  from  training  fields  (STAT) . 

» 2.  Select  the  best  of  four  channels  from  MSS  DATAPE  (SELECT). 

3.  Classify  the  MSS  data  (CLASSIFY). 

0 4.  Display  the  classification  results  six  times  exercising  the 

chi-square,  empirical,  and  no  thresholding  options  for  both 
the  training  and  the  test  fields. 
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SAMPLE  RUN  6 


f 

* 

This  run  demonstrates  the  use  of  the  DATAFILE  control  card  in 
accessing  several  files  of  a data  tape  in  back-to-back  executions 
of  HIST. 


I 


¥7? 


f l RUN  L7«O62tTF7.H*.1659#C0B7,Ci5.500  WILLS 

t ASC  C * V 1 5 A 9 5 

• ASG  2* V LC49R 

• XCT  CbR 
TRW  Z 


*s»  u.  4.  — jj  *i  —*  Hr  < uj  «*  •—  Ui<c<— 

j u.  * h # >-  z u-  • * * h as  Jit  • n • h ^ _j  u.  • ip  * »-  ir  jh  • 

ZacCiw/uja^Luuuir.  4ajacQJCj^cuicnualLUfl/u,a‘;uJaw/utCL<u 
• «l  U to  Q Q • H«***0»POC*  u.  U VI  D C # U.  «*  -*0^00* 


I 

I 

i: 


FILE3  IlflJ.U,  1) . (5.1) , (5,12), ( I. 12) 

VENC* 
t P*0 
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